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The present invention generally relates to the production of bioactive 



compounds such as ectemascidins and methods of making such ecteinascidins 
using a variety of methods. In particular, bacterial symbionts of the ascidian 
Ecteinascidia iurbimta, its proteins or genes can be used in such methods. 



The ectemascidins are a group of bioactive tetrahydroisoqnmoiine 
ill al< »ids isolated from the Caribbean mangrove ascidiaa Ecteinascidia turbinata 
(Rinehart et al., 1990 and Wright et al., 1990). The most active of these 
compounds, ecteinascidm 743 (Et-743), (Compound 1) shows remarkable 

15 potency in both in vitro and in vivo bioassays, and possesses picomolar activity 
against breast, colon, non-small cell lung carcinoma (NSCLC), renal, ovarian, 
and melanoma cell lines (Zewaii-Foote et al, 1999). The potent activity of Et- 
743 and its clinical potential places Et-743 in a small group of marine 
invertebrate-derived compounds, including diseodermolide (Givnasekera et al,, 

20 1990), bryostatms (Pettit et al., 1982) and eleutherobin (Lindel et al., 1997). 
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Background of the Invention 
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6 jorumycin 7 Demerol 8 mimosamycin 

Et-743 binds to the minor groove of DNA and specifically alkylates the 
N2 of guanine (Poinniier et ai., 1996 and Moore et al, 1997), The drug differs 
from the other minor groove-interactive compounds because it induces a bend in 
5 the DNA helix toward the major groove (Zewail-Foote et ah, 1999). Et-743 
shows anti-tumor activity with IC^s in the nanomolar range and is related to a 
large number of less potent compounds (Rinehart et al s 1990 and Faircloth et al, 
1996). Et-743 is in phase I ehnical trials in the United States and Europe and is 
in phase H cikiical trials in France (Valori et al, 1988 and Rinehart, 2001). 
1 0 Mechanistically, Et-743 disrupts the microtubule network and binds 

substantially irreversibly to 5-GGG, S'-GGC and 5*-AGC DNA sequences that 
leads to cell cycle arrest in the late S and G2/M phase (Garcia-Rocha et ai, 1996; 
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Moire et al, 3 997; Pommier et al, 1996; Morre et al, 1 998; and Faircloth et al., 
1996), 

Structurally, Et-743 is similar to the renieramycins (He et al. } 1989 and 
Frmcke et al, 1982) from the marine sponge Reniera sp; It shows remarkable 
5 structural similarity to the less active saframyek antibiotics, (Compounds 2-3), 
respectively, isolated from the bacteria Streptomyces lavendulae (Arai et at, 
1983), Pseudomonas fluoresceins (Mews et al., 1983) and Myxococcus xanthus 
(Irschik et al., 1988), as well as the renieramycins (Compound 4) and cribrostatin 
4 (Compound 5) isolated from the marine sponges Reniera sp. and 

10 O-ibrochalina sp., respectively (Frmcke et al, 1982 and Perth et al, 2000). 
Saframyein derivates are produced in culture and synthetic routes exist for 
derivatives of most of the isoquinolone antibiotics. 

Jorumycin (Compound 6) was isolated from the Indian nudibranch 
JonmnafunebrU that was found "associated with" a blue sponge, identified as 

15 Oceanapia sp. (Fontana et al, 2000). The monomelic isoquinohnes 

(Compounds 7 and 8) were also found in the Oceanapia sponge, as well as 
culture broths of 5. lavendulae, and a variety of other sponges (Frmcke and 
Faulkner, 1982; McKee and Ireland, 1987; Parameswaran et al, 1998). It is 
believed that compounds such as Compounds 7 and 8 are the result of oxidative 

20 cleavage of the dimeric compounds and probably arise from the isolation and 
extraction procedure (Frincke and Faulkner, 1982). Et-743 differs from 
compounds 2-6 primarily by the presence of the C ring sub-unit system, which is 
critical for its high potency and tumor selectivity (Sakai et al., 1996). 

Et-743 is expensive, since natural marine products are typically difficult 

25 to obtain in sufficient quantity for clinical use. Although aquacultuxe currently 
provides a source for Bt-743, it yields only a small amount of the compound per 
mass of invertebrate (me average yield of the drug from the ascidian are 
generally low (0.01-0.02 % dry weight)). Until recently, the most efficient 
method of Et-743 production was aquacufture ofE, iurbinata (Corey et al., 

30 1996). Efforts have been made to produce Et-743 more economically by 

synthetic processes (Martinez and Corey, 2000; Zhou et al, 2000; Herberich et 
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al.j 2001). A. relatively efficient chemical synthesis exists for the compound but 
is a less promising source than aquaculture {Corey et al. } 1996). Cuevas et al 
reported efficient semi-synthesis of Et-743 from cyanosafracin B (Compound 3) 
whicii can be easily obtained in kilogram quantities from culture of 
5 Pseudomonasjlnorescms (Cuevas et al., 2000). An attractive alternative to 
aquaculture or even semi-synthesis may be the cloning of the biosynthetic genes 
into a more tractable host, such as P. fluorescent. Thus, there is a need for 
alternative methods for producing ectemascidms, such as Et-743. 

10 Brief Description of the Figures 

FIG. 1 depicts the structure of Ecteinascidin-743 (Et-743). 

FIG. 2 depicts the modification of L-Tyr. 

FIG. 3 depicts a scheme for the synthesis of Et-743. 

FIG. 4 depicts in situ hybridization of bacterial 16S rKNA probes to 
15 larvae of E. iurbinata. 

FIG. 5 depicts adult 2?. iurbinata showing brooding of eggs in lower 
portion of the zooid. (1 5X) (a); excised larvae at various development stages 
(20X)(b). 

FIG . 6 depicts (a) ESI mass spectrum from the £. iurbinata larva] extract; 
20 (b) MS/MS of the parent ion (744) showing loss of the C-ring subunit of Et-743 . 

FIG. 7 depicts DGGE gel of PCR products amplified with 1055f-1392r. 
1 ; E. turbinates 2, E. iurbinata, 3. £. colt Arrows point to the two hands that 
were excised and sequenced, 

FIG. B depicts phylogram obtained with maximum parsimony and lOOx 
25 bootstrapping (indicated by numbers). 

FIG, 9 depicts FISH experiments using fluorescein labeled probes. 
A, Domain Bacteria probe Eub338; B. no probe control; C E. palauensis probe 
1247r; D. E. sertuia probe 1253r. 

FIG. 10 depicts FISH micrographs of is, turbinaia larvae using all 
30 excitation wavelengths on a conJbcal niicroscope. A. Whole larva hybridized 
with CY5 labeled Eub338 (indicated by blue) 10X, scale bar - 100 pm. (B-D) 
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images of larvae taken near adhesive papilla© B. 40X, scale bar ~ 20 urn, C, 
63X, scale bar ~ 10 pm, D. 63X plus digital zoom, scale bar - 5 pro. 

FIG. 11 depicts PCR amplificatioii of B. mrbinata larval DNA using a 
combination of universal, Endobugtda specific and E. ,sert&I<z specific primers, 1 
5 =*£ turbinate larval DNA, 2*2?. pacifica DNA used as a control, 3 - negative 
control. A: usiversal bacterial primers (27f-1.492t)> B: universal with 
Endobugula specific (27f-855r), C: & serta/a specific (240f-1253r), D: 
universal witiiE. sertula specific (2?f-12S3r), 

FIG. 12 depicts light micrograph sections of£ turbinata larva through 
10 the adhesive papillae. A. 16X, scale bar = 70 um B. 40X, scale bar « 30 pro, 

SaiSjMIXof the Invartfea 

Methods provided herein include using bacteria that are involved at least 
in part in the synthesis of bioactive compounds. These bacteria can be used in 

15 production processes, such as fermentation, to produce valuable bioactive 

compounds. Alternatively, polypeptides present in such bacterial or nucleic acid 
molecules that encode such polypeptides can be used, to produce bioactive 
compounds in ex vivo processes or in recombinant organisms engineered to 
express such polypeptides. The present invention provides these arid related 

20 benefits. 

The present invention recognizes feat organisms such as bacteria contain 
nucleic acid molecules that encode polypeptides that catalyze the synthesis of 
bioactive compounds, such as ecteinascidins. 

25 One aspect of the present invention is a bacterium or bacteria that can be 

a pure or mixed culture, that at least hi part synthesizes at least one ectemascidin. 

A second aspect of the present invention is a preparation of bacteria from 
an Ecteinascidia, such as Ecteinascidia turbinata, that includes at least one 
activity involved in the synthesis of an ecteinascidia. 
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Nucleic Acids 

A third aspect of the present invention is a composition including at least 
one nucleic acid molecule that encodes at least one polypeptide that catalyzes at 
least one step in the synthesis of at least one eotemasctdin. 
5 A fourth aspect of the present invention is a composition including a 

library of nucleic acid molecules of the present invention. These nucleic acid 
molecules can be used in a combinatorial biosynthesis of ecteinascidms. 

A fifth aspect of the present invention is a composition including at least 
1 0 one polypeptide that catalyzes at least one step in the synthesis of at least one 
ect emaseidin, where preferably at least one polypeptide is derived from at least 
one marine organism, 

A sixth aspect of the present invention is a composition including a 
library of polypeptides of the present invention. 
15 Antibodies 

A seventh aspect of the present invention is an antibody that hinds with 
or specifically binds with one or more bacteria of the present invention. 

An eighth aspect of the present invention is an antibody that binds with 
or specifically binds with a polypeptide of the present invention, 
20 Method oj , tive cgm p ^nji s 

Bacteria 

A ninth aspect of the present invention is a method of making an 
eeteinascidin using a bacteria of the present invention, 

A tenth aspect of the present invention is a composition made using a 
25 bacteria of the present invention. 
Nucleic Acids 

An eleventh aspect of the present invention is a method of making a 
composition, including providing at least one nucleic acid of the present 
invention and synthesizing at least one eetemascidin. 
30 A twelfth aspect of the present invention is a composition made using a 

nucleic acid of die present invention, 
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Polype ptides 

A thirteenth aspect of the present invention is a method of making a 
composition including providing at least one polypeptide of the present invention 
and synthesiaing at least one ecteinascidm. 
5 A fourteenth aspect of the present invention is a composition made using 

a polypeptide of the present invention. 
. Method for identifimg nucleic acid molecules 

A fifteenth aspect of the present invention is a method for identifying at 
leasi one .nucleic acid molecule encoding at least one activity involved in the 
10 synthesis of an ecteinascidin, comprising contacting a nucleic acid molecule of 
the present invention with a sample, and identifying nucleic acid molecules in 
said sample mat hybridize with a nucleic acid of the present invention. 

A sixteenth aspect of the present invention is a nucleic acid molecule 
identified by a method of the present invention. 
15 A seventeenth aspect of the present invention is a composition 

comprising a library of nucleic acid molecules identified using a method of the 
present invention. 

Method for identifying bioactive compounds 
B acteria 

20 An eighteenth aspect of the present invention is a method for identifying 

a bioactive compound including contacting a compound made or identified using 
a bacteria of the present invention with a detection system and determining the 
bioactivity of the compound. 

A nineteenth aspect of the present inventi on is a compound identified by 

25 this method. 

Atwentieth aspect of the present invention is a method for identifying a 
bioactive compound including contacting a compound made or identified using a 
nucleic acid molecule of the present invention with a detection system and 
3 0 deterauning the bioactivity of the compound. 
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A tiventy-first aspect of fee present invention i s a compound identified by 
this method. 

Polype ptide 

A twenty-second aspect of the present invention is a method for 
5 identifying a bioaetive compound including contacting a compound made or 
identified using a polypeptide of the present invention with a detection system 
and determining the bioactivity of said compound, 

A twenty-third aspect of the present invention is a compound identifi ed 
by this method. 

10 

Betaflgd description of the Invgaiioj* 

Definitions 

Unless defined otherwise, all technical and scientific terms used herein 
have the same meaning as commonly understood by one of ordinary skill in the 

15 art to which tins invention belongs. Generally, the nomenclature used herein and 
the laboratory procedures in cell culture, chemistry, microbiology, molecular 
biology, cell science and cell culture described below are well known and 
commonly employed in the art. Conventional methods are used for these 
procedures, such as those provided in the art and various general references 

20 (Sambrook et at, 1989; Ausubei et al> 1998; Harlowe and Lane, 1 988). mere a 
term is provided in the singular, the inventors also contemplate the plural of that 
term. The nomenclature used herein and the laboratory procedures described 
below are those well known and commonly employed in the ait. As employed 
throughout the disclosure,, the following terms, unless otherwise indicated, shall 

25 be understood to have the following meanings : 

"Membrane permeant derivative" refers to a chemical derivative of a 
compound that increases membrane permeability of the compound. These 
derivatives are made better able to cross cell membranes because hydrophilie 
groups are masked to provide more hydrophobic derivatives. Also, the making 

30 groups can be designed to be cleaved front the compound within a cell to make 
the compound more hydrophilie ones within the cell. Because the substrate is 
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more hydrophilic &an the membrane permeant derivative, it preferentially 
localizes within the cell {U.S. Patent Ho. 5,741,657 to Tsien et at, issued 
April 21,1998). 

"Isolated polynucleotide" refers to a polynucleotide of genomic, eDNA, 
5 PCR or svnthetic origin, or some coi binat n there f, which by virtue of its 
origin, the isolated polynucleotide (1) is not associated with the eel! in which the 
isolated polynucleotide is found in nature, or (2) is operably linked to a 
polynucleotide that it is not linked to in nature, The isolated polynucleotide can. 
optionally be linked to promoters, enhancers, or other regulator)' sequences. 

1 0 "Isolated protein" refers to a protein of cDNA, recombinant RNA, or 

synthetic origin, or some combination thereof, which by virtue of its origin the 
isolated protein (1) is not associated with proteins normally found within nature, 
or (2) is isolated from the ceil in which it normally occurs, or (3) is isolated free 
of other proteins from the same cellular source, for example, free of cellular 

1 5 proteins), or (4) is expressed by a cell from a different species, or (5) does not 
occur in nature. 

"Polypeptide" is used herein as a generic term to refer to native protein, 
fragments, or analogs of a polypeptide sequence. 

"Active fragmen t" refers to a fragment of a parent molecule, such as an 
20 organic molecule, nucleic acid molecule, or protein or polypeptide, or 

combinations thereof that retains at least one activity of the parent molecule. 

"Naturally occurring" refers to the fact that air object can be found in 
nature. For example, a polypeptide or polynucleotide sequence that is present in 
an organism, including viruses, that can' be isolated from a source in nature and 
25 which has not been intentionally modified by man in the laboratory is naturally 
oexurring. 

"Operably linked" refers to a juxtaposition wherein the components so 
described are in a relationship permitting them to function in their intended 
manner. A control sequence operably linked to a coding sequence is ligated in 
30 such a way feat expression of the coding sequence is achieved under conditions 
compatible with the control sequences. 
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"Control sequences" refer to polynucleotide sequences that effect the 
expression of coding and non-coding sequences to which they are ligated. The 
nature of such control sequences differs depending upon the host organism; in 
prokaryotes, such control sequences generally include promoter, ribosomal 
5 biding site, atid transcription, terminatic q in eukary. erall 

such control sequences include promoters and transcription termination 
sequences. The term control sequences is intended to include components whose 
presence can influence expression, and can also include additional components 
whose presence is advantageous, for example, leader sequences and fusion 
1 0 partner sequences, 

'Polynucleotide" refers to a polymeric form of nucleotides of a least ten 
bases in length, either ribonucleotides or deoxyribonacleotides or a modified 
from of either type of nucleotide. The term includes single and double stranded 
forms of DNA or RNA. 
1 5 "Genomic polynucleotide" refers to a portion of the genome. 

"Active genomic polynucleotide" or active portion of a genome" refer to 
regions of a genome that can be up regulated, down regulated or both, either 
directly or indirectly, by a biological process. 

"Directly" in the context of a biological process or processes, refers to 
20 direct causation of a process that does not require intermediate steps, usually 
caused by one molecule contacting or binding to another molecule (the same 
type or different type of m olecule). For example, molecule A contacts molecule 
B, which causes molecule B to exert effect X that is part of a biological process, 

"Indirectly" in the context of a biological process or processes, refers to 
25 indirect causation that requires intermediate steps, usually caused by two or more 
direct steps. For example, molecule A contacts molecule B to exert effect X 
which in torn causes effect Y, 

"Sequence identity" refers to the proportion of base matches between two 
nucleic acid sequences or the proportion of amino acid matches between two 
30 amino acid sequences. When sequence identity is expressed as a percentage, for 
example 50%, the percentage denotes the proportion of matches of the length of 
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sequences from a desired sequence that is compared to some other sequence. 
Gaps (in either of the two sequences) are permitted to maximize matching; gap 
lengths of 15 bases or less are usually used, 6 bases or less are preferred with 2 
bases or less more preferred. When using oligonuleorides as probes, fee 
5 sequence identity between the target nucleic acid and fee oligonucleotide 
sequence is preferably not less than 10 target base matches out of 20 (50% 
identity) and more preferably not less than about 60% identity; 70% identity, 
80% identity or 90% identity), and most preferably not less than 95% id entity, 
"Selectively hybridize" refers to deiectably and specifically bind. 

10 Polynucleotides, oligonucleotides and fragments thereof selectively hybridize to 
target nucleic acid strands, under hybridization and wash conditions that 
minimize appreciable amounts of detectable binding to nonspecific nucleic acids. 
High stringency conditions can be used to achieve selective hybridization 
conditions as known in the art. Generally, the nucleic acid sequence identity 

1 5 between the polynucleotides, oligonucleotides, and fragments thereof and a 
nucleic acid sequence of interest wilt he at least 30%, and more typically and 
preferably of at least 40%, 50%, 60%, 70%, 80% or 90%. 

Hybridization and. washing conditions are typically performed at high 
stringency according to conventional hybridization procedures. Positive clones 

20 are isolated and sequenced. For example, a full length polynucleotide sequence 
can be labeled and used as a hybridization probe to isolate genomic clones from 
an appropriate 'target library as they are known in the art. Typical hybridization 
conditions and methods for screening plaque Hits and other purposes are known 
in the ait (Benton and Davis, Science 1.96:180 (1978); Sambroofc et al, supra, 

25 (1989)), 

Two amino acid sequences have share identity if there is a partial or 
complete identity between their sequences. For example, 85% identity means 
that 85% of the amino acids are identical when the two sequences are aligned for 
maximum matching. Gaps (in either of the two sequences being matched) are 
30 allowed in .maximizing matching; gap lengths of 5 or less are preferred with 2 or 
less being more preferred. Alternatively and preferably, two protein sequences 
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(or polypeptide sequences derived from (hem of at it asi 3 1 amino acids in 

bntii as this term is used ! \ Ih have ignraent score 
of at least 5 (in standard deviation units) using the program .ALIGN with the 
mutation data matrix and a gap penalty of 6 or greater (Dayhoff, in Atlas of 
5 Protein Sequence and Structure, National Biomedical Research Foundation, 
volume 5, pp. 101-110 (1972) and Supplement 2, pp. MO). 

"Corresponds to" refers to a polynucleotide sequence that shares identity 
(for example is identical) to all or a portion of a reference polynucleotide 
sequence, or that a polypeptide sequence is identical to all or a portion of a 

10 reference polypeptide sequence, in contradistinction, the term "complementary 
to" is used herein to mean that the complementary sequence is homologous to all 
or a portion of a reference polynucleotide sequence. For illustration, the 
nucleotide sequence TAT AC corresponds to a reference sequence TAT AC and is 
complementary to a refer ence sequence GT ATA. 

1 5 The following terms are used to describe the sequence relationships 

between two or more polynucleotides: "reference sequence," "comparison 
window/' "sequence identity," "percentage of sequence identity," and 
"substantial identity." A reference sequence is a defined sequence used as a 
basis for a sequence comparison; a reference sequence can be a subset of a larger 

20 sequence, for example, as a segment of a full length cDNA or gene sequence 
given in a sequence listing, or may comprise a complete cDNA or gene 
sequence. Generally, a reference sequence is at least 20 nucleotides in length, 
frequently at least 25 nucleotides in length, and often at least 50 nucleotides in 
length. Since two polynucleotides can each (1) comprise a sequence (for 

25 example a portion of the complete polynucleotide sequence) that is similar 
between the two polynucleotides, and (2) may further comprise a sequence that 
is divergent between the two polynucleotides, sequence comparisons between 
two (or more) polynucleotides are typically performed by comparing sequences 
of the two polynucleotides over a "comparison window" to identify and compare 

30 local regions of sequence similarity, A comparison window, as used herein, 
refers to a conceptual segment of at least 20 contiguous nucleotide positions 
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wherein a polynucleotide sequence may be compared to a reference sequence of 
at least 20 contiguous nucleotide m 1 t I cm the portion of the polynucleotide 
sequence in the comparison window can comprise additions and deletions (for 
example, gaps) of 20 percent or less as compared to the reference sequence 
5 (which would not comprise addi li ons or deletion s) for optimal alignment of the 
two sequences. Optimal alignment of sequences for aligning a comparison 
window can be conducted by the local identity algorithm (Smith and Waterman, 
Adv, Appl Math.,. 2:482 (1981)), by the identity alignment algorithm 
(Needlemaa and Wunseh, J. Mol. Bio. s 48:443 (1970)), by the search for 

10 similarity method (Pearson and Lipman, Proc. NatL Acid. Sci. U.S. A, 85:2444 
(1.988)), by the computerized unplementations of these algorithms such as GAP, 
BESTF1T, FASTA and TFASTA (Wisconsin Genetics Software Page Release 
7.0, Genetics Computer Group, Madison, WI), or by inspection. Preferably, the 
best alignment (for example, the result having the highest percentage of identity 

15 over the comparison window) generated by tire various methods is selected. 

"Complete sequence identity" means mat two polynucleotide sequences 
are identical (for example, on a nucleotide-by-nucleotide basis) over the window 
of comparison. 

"Percentage of sequence identity" is calculated by comparing two 
20 optimally aligned sequences over the window of comparison, determining the 
number of positions at which the identical nucleic acid base occurs in both, 
sequences to yield the number of matched positions, dividing the number of 
matched positions by the total number of positions in the window of comparison 
(for example, the window size), and multiplying the result by 100 to yield the 
25 percentage of sequence identity. 

''Substantial identity" as used herein denotes a characteristic of a 
polynucleotide sequence, wherein the polynucleotide comprises a sequence that 
has at least 30 percent sequence identity, preferably at least 50 to 60 percent 
sequence, more usually at least 60 percent sequence identity as compared to a 
30 reference sequence over a comparison window of at least 20 nucleotide 

positions, frequently over a window of at least 25 to 50 nucleotides, wherein the 
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per itagi I sequence identity is calculated by comparing fee reference 
sequence to the polynucleotide sequence that may include deletions or addition 
which total 20 percent or less of the reference sequence over the window of 
comparison. 

5 "Substantial identity" as applied to polypeptides herein means that two 

peptide sequences, when optimally aligned, such as by fee programs GAP or 
BESTFIT using default gap weights, share at least 30 percent sequence identify, 
preferably at least 40 percent sequence identity, and mors preferably at least 50 
percent sequence identity, and most preferably at Jest 60 percent sequence 
1 0 identity. .Preferably, residue positions, which are not identical, differ by 
conservative amino acid substitutions. 

"Conservative amino acid substitutions" refer to the mterchangeability of 
residues having similar side chains. For example, a group of amino acids having 
aliphatic side chains is glycine, alanine, valine, leucine, and isoleucins; a group 
1 5 of amino ac ids having aliphatic-hydroxyi side chains is serine and threonine; a 
group of amino acids having amide-containing side chains is asparagine and 
gmtamine; a group of amino acids having aromatic side chains is phenylalanine, 
tyrosine and tryptophan; a group of amino acids having basic side chains is 
lysine, arginine and histidrne; and a group of amino acids having sulfur- 
20 containing side chan is cystein and methionine. Preferred conservative amino 
acid substitution groups are: valine-leucine-isolencine; phenylalanine-tjTosine; 
lysine-arginine; alanine-valine; glutamie-aspartic; and asparagine-glutamine. 

"Modulation" refers to the capacity to either enhance or inhibit a 
functional property of a biological activity or process, for example, enzyme 
25 activity or receptor binding. Such enhancement or inhibition may be contingent 
on the occurrence of a specific event, such as activation of a signal transduction 
pathway and/or may be manifest only in particular cell types. 

"Modulator" refers to a chemical {naturally occurring or non-naturaily 
occurring), such as a biological macromoleeule (for example, nucleic acid, 
. 30 protein, non-peptide or organic molecule) or an extract made from biological 
materials, such as prokaryotes, bacteria, eukaryotes, plants, fungi, multicellular 
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organi sms or animals, invertebrates, vertebrates, mammals and humans, 
including, where appropriate, extracts of: whole organisms or portions of 
organisms, cells, organs, tissues, fluids, whole cultures or portions of cultures, or 
environmental samples or portions thereof Modulators are typically evaluated 
5 for potential activity as inhibitors or activators (directly or indirectly) of a 
biological process or processes (for example, agonist, partial antagonist, partial 
agonist, antagonist, antineoplastic, cytotoxic, inhibitors of neoplastic 
transformation or cell proliferation, cell proliferation promoting agents, antiviral 
agents, antimicrobial agents, antibacterial agents, antibiotics, and the like) fay 

10 inclusion in assays described herein. The activi ty of a modulator may be known, 
unknown or partially known. 

"Test chemical" refers to a chemical or extract to be tested by at least one 
method of the present invention to be a putative modulator. A test chemical is 
usually not known to bind to tire target of interest. "Control test chemical" refers 

15 to a chemical known to bind to the target (for example, a known agonist, 

antagonist, partial agonist or inverse agonist). Test chemical does not typically 
include a chemical added to a mixture as a control condition that alters the 
function of the target to determine signal specificity in an. assay. Such control 
chemicals or conditions include chemicals that (l)non-specificaUy or 

20 substantially disrupt protein structure (for example denaturing agents such as 
urea or goandium, suifhydry! reagents such as dithiotritol and faeta- 
mercaptoethanol), (2) generally inhibit cell metabolism (for example 
mitochondrial uncouples) and (3) non-specifkally disrupt electrostatic ot 
hydrophobic interactions of a protein (tor example, high salt concentrations or 

25 detergents at concentrations sufficient to non-specifically disrupt hydrophobic or 
electrostatic interactions). The term test chemical also does not. typically include 
chemicals knows to be unsuitable for a therapeutic use for a particular indication 
due to toxicity of the subject. Usually, various predetermined concentrations of 
test chemicals ax e used for determining their activity. If the molecular- weight of 

30 a test chemical is known, the following ranges of concentrations can be used: 
between about 0.001 micromolar and about 10 miOimolar, preferably between 
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about 0.01 niicromolar and about 1 imllimolar, more preferab.1 , be( . \»en about 
04 microinolar and about 100 niicromolar. When extracts are uses a test 
chemicals, the concentration of test chemical used can be expressed on a weight 
to volume basis. Under these circumstances, the following ranges of 
5 concentrations can be used: between about 0.00 1 micrograms/ral and about 1 00 
ixulhgrara/nal, preferably between about 0.0 1 micro grams/ml and shout 1 0 
milligrams/ml, and more preferably between about 04. micrograms/ml and about 
1 milligrams/ml or between about 1 microgram/ml and about 100 
micrograms/ml. 

10 "Target" refers to a biochemical entity involved in a biological process, 

Targets are typically proteins that play a useful role in the physiology or biology 
of an organism, A therapeutic chemical typically binds to a target to alter or 
modulate its function. As used herein, targets can include, but not be limited to, 
cell surface receptors, G-proteins, G-protein coupled receptors, kinases, 

1.5 phosphatases, ion channels, lipases, phosbolipases, nuclear receptors, 
mtracellulai tbules, tubulin, and the like. 

"Label" or "labeled" refers to incorporation of a detectable marker, for 
example by incorporation of aradiolabled compound or attachment to a 
polypeptide of moieties such as biotin that can be detected by the binding of a 

20 section moiety, such as marked avidin> Various methods of labeling 
polypeptide, nucleic acids, carbohydrates, and other biological or organic 
molecules are known in the art. Such labels can have a variety of readouts, such 
as radioactivity, fluorescence, color, chemilumisescence or other readouts 
known in the art or later developed. The readouts can be based on enzymatic 

25 actuJ. 1 *sb€ lac! lase, b lactamase, horseradish peroxidase, 
alkaline phosphatase, iuciferase; radioisotopes such as 3 H, 14 C, 3 % n$ l or ,3! 1); 
fluorescent proteins, such as green fluorescent proteins; or other fluorescent 
labels, such as FITC, rhodamine, and laathanid.es. Where appropriate, these 
labels can be the product of the expression of reporter genes, as that term is 

30 understood in the art. Examples of reporter genes are beta-laetamase (U.S. 

Patent No. 5,741,657 to Tsien et al, issued April 21 , 1998) and green fluorescent 
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protein. (U.S. Patent No. 5,777,079 to Tsien et al. } issued My 7, 1998; U.S. 
Patent Ho. 5,804,387 to Corrnack et ah, issued September 8, 1998). 

"Substantially pure" refers to an object species or activity that is the 
predominant species or activity present (for example on a molar basis it is more 
3 abundant than any other individual species or activities in the composition) and 
preferably a substantially purified fraction is a composition wherein the object 
species or activity comprises at least about 50 percent (on a molar, weight or 
activity basis) of all macromolecules or activities present. Generally , as 
substantially pure composition will comprise more than about 80 percent of all 

10 macrornoleeuiar species or activities present in a composition, more preferably 
more than about 85%, 90%, 95% and 99%. Most preferably, the object species 
or activity is purified to essential homogeneity, wherein contaminant species or 
activities cannot be detected by conventional detection methods) wherein the 
composition consists essentially of a single macrornoleeuiar species or activity. 

15 The inventors recognize that an activity may be caused, directly or indirectly, by 
a single species or a plurality of species within a composition, particularly with 
extracts. 

A "ecteinascidin synthesis activity' is at least one activity involved in. the 
synthesis of an ecteinascidin. Examples of ecteinascidin synthesis activity can 
20 include without limitation a non-ribosomal peptide synthetase activity, an O- 
niethy! transferase activity, or a transaminase activity. 

"Pharmaceutical agent or drug" refers to a chemical, composition or 
activity capable of inducing a desired therapeutic effect when property 
administered by an appropriate dose., regime, route of administration, time and 
25 dehvery modality. 

"Pharmaceutical agent or drug" refers to a chemical composition or 
activity capable of inducing a de-sired therapeutic effect when property 
administered by an appropriate dose, regime, route of adminisnarion, time and 
delivery modality. 

30 A "hioactive compound" refers to a compound, that exhibits at least one 

bioacdvity. A hioactive compound may be an organic molecule or inorganic 
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molecule, protein, lipid, carbojiydrate, or nucleic acid, or any combination 
thereof, including complexes of any of these compounds. 

A "bioaetivity" refers to at least one activity that modulates a biological 
process. Preferred bioactivities include, but are not limited to: antitumor 
5 activity, anticancer cell activity, cytotoxic activity, antibacterial activity, 
antimicrobial activity, not. being substantially susceptible to multi-drug 
resistance, antiviral activity, immunomodulatory activity, anti-inflammatory 
activity, and radiation protective activity. 

"An ecteinascidin" refers to a compound that includes a 
10 tetrahydroisoquinoline alkaloid derived from the marine tunicate Ecteinascidia 
lurbinata and has at least one bioaetivity. 

"Made at least in part" or "at least in part synthesized" means that an one 
or more entities participate in one or more steps in the synthesis of the compound 
of interest or precursor of a compound of interest but may not be the sole entity 
1 5 or entities that participate in the synthesis of the compound or precursor of 
interest. When a single step in the synthesis of a compound of interest or 
precursor of a compound of interest is performed by one or more entities, "made 
at least in part" can refer to the participation of one or more entities in a synthetic 
steps or process that also requires the participation of at least one other entity. 
20 A "bioactive derivative" refers to a modification of a bioactive compound 

or bioaetivity that retains at least one characteristic activity of the parent 
compound, 

A ''bioactive precursor" refers to a precursor of a bioactive compound or 
bioaetivity that exhibits at least one characteristic activity of the resulting 
25 bioactive compound or bioaetivity. 

An "antibacterial activity" refers to an activity that reduces the growth 
rate or numbers of living in a sample, such as a culture of bacteria or a sample 
that includes at least one bacteria, including a patient. Such antibacterial activity 
can be directed against Gram-negative or Gram-positive bacteria and can be 
3 0 screened for or confirmed using methods known in the art 
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All "antmucrobial activity* 4 refers to an activity that reduces the growth 
rate or numbers of living microbes in a sample (including prokaryotie aid/or 
eukaryotic microbes), such as a culture of microbes or a sample that includes at 
least one microbe, including a patient, and can be screened for or confirmed 
5 using methods known in the art. 

An "antiviral activity" refers to an activity that reduces the infecfi\% of 
virus particles in a sample, such as in a sample including at least one vims, 
including a patient Such antiviral activity can bs directed against, from example, 
DNA or KM A containing viruses, including, but not limited to, herpesvirus, 
1 0 hepatitis virus and retrovirus. Such activity can be screened for using methods 
known, in the art. 

An "antitumor activity* refers to an activity that reduces fee growth rate 
or number of tamor cells in a sample, such as a culture of tumor cells or a sample 
that includes ai least one tumor cell, including a patient Such antitumor activity 
15 can be directed against any type of tumor or tumor cell including, but not limited 
to renal tumor, lung tumor, colon tumor, central nervous system tumor, 
melanoma, ovarian hmior and breast tumor. 

An "anticancer cell activity' refers to an activity that reduces the growth 
rate or number of cancer cells in a sample, such as a culture of cancer cells or a 
20 sample that includes at least one cancer cell, including a patient. Such anticancer 
cell activity can be directed against any type of cancer cell, including, but not 
limited to renal cancer, leukemia, lung cancer, colon cancer, central nervous 
system cancer, melanoma, ovarian cancer and breast cancer. 

A "cytotoxic activity" refers to an activity that reduces the number of 
25 viable cells in a sample, including prokaryotie ceils, eukaryotic cells or both. 
Such activity can be screened using methods known in the art. 

A "patient" or "subject" refers a whole organism in need, of treatment, 
such as a farm animal, companion animal or human. An animal refers to any 
non-human animal. 

30 Other technical terms used herein have their ordinary meaning in the art 

that they are used, as exemplified by a variety of technical dictionaries, such as 
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the McGraw-Hi ll Dictionary of Chemical Terms and the Stedman' s Medical 
Dictionary. 

The hi I >gic i 2 tdeci It gical functions of the ectemascidins are 
unknown. Some of the roles generally ascribed to secondary metabolites include 
5 deterrence against probation, inliibition of the settlement of fouling organisms, 
protection from bacterial or viral infections, and prevention of encroachment by 
adjacent organisms. 

Localization studies of compounds in sponges or ascidians have 
depended upon the presence of unique chemical structural features, such as 

1 0 heavy elements or pH dependent fluorescence, high bioactivity, and/or the ability 
to separate morphologically distinct bacterial and eukaryotic cells (Thompson et 
at, .1983; Bewlcy et ai. s 1996; Salomon et al„ 2001; and Unson et al. } 1994). 
However, many natural products do not have distinctive structural 
characteristics, and may not have high activity in bioassays. In addition, some 

15 invertebrates are not amenable to tissue dissociation and cell enrichment, and 
many bacteria do not have unique morphologies. Invertebrates containing 
interesting natural products are particularly difficult to work with because many 
of them cannot be cultured easily in a laboratory and often m ust be collected and 
preserved in remote, tropical areas. Additionally, sponge and ascidian systems 

20 are' complicated by the fact that the adults filter seawaier for food and generally 
contain large quantities of dietary-derived bacteria within their mesophyl matrix 
and gut, respectively, in order to study the potential role of symbionts in the 
biosynthesis of specific compounds in such complicated systems, it is necessary 
to employ a combination of chemical, biological and genetic techniques. 

25 One method of studying symbionts in typically complex 

mvertebmte/microbial associations is to focus on the chemistry and bacterial 
populations of non-feeding larvae. Many invertebrate larvae swim for a short 
period of time (minutes to days) and often do not filter feed until they 
permanently settle onto a surface and metamorphose into an adult (Strathmann, 

30 1985). Many invertebrates associated with obligate symbionts are known to 
transmit bacteria vertically to their offspring; therefore, any bacteria that are 
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consistently and specifically associated with the larvae are presumed to be true 
symbionts (Kraeger et al, 1996; Benayahu and Schleyer, 1998; Hirose, 2000). 
This approach was used to implicate a bacterial symbiotit as a source of the 
potential anti-cancer agent ryi i- nihebry >an Bngirf / 1 The; 
5 bryozoan larvae were found to contain both the bryostatins as well as a 
monoculture of symbiotic proteobacteria identified as Endobugula sertula 
candidatus (Haygood and Davidson, 1997; and Davidson and Haygood. 1999). 

The present, invention recognizes that organisms such as bacteria contain 
nucleic acid molecules that encode polypeptides that catalyze the synthesis of 
1 0 hioactive compounds, such as eeteinascidins. 

As a non-limiting introduction to the breath of the present invention, the 
present invention includes several general and useful aspects, including: • 



1) a bacteria that at least in part synthesizes at least one 
ecteinascidin; 

2) a preparation of bacteria from an Ecteinascidia, such as 
Ecteinascidia turbinate, that includes at least one activity 
involved in the synthesis of an ecteinascidin; 

3) a composition including at least one nucleic acid molecule that 
encodes at least one polypeptide that catalyzes at least one step in 
the synthesis of at least one ecteinascidin; 

4) a composition including a library of nucleic acid molecules of the 
present raveation* where one or more of these nucleic acid 
molecules can be used in a combinatorial biosynthesis of 



a composition including at least one polypeptide that catalyzes at 
least one step in the synthesis of at least one ecteinascidin, where 
preferably at least one polypeptide is derived from at least one 
marine organism; 

a composition including a library of polypeptides of the present 
invention; 
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?) an antibody that binds with , with a bacteria of 

the present invention; 

8) an antibody that binds with or specifically binds with a 
polypeptide of the present invention; 

9) a method of making an octeinascidin using a bacteri a of the 
present invention; 

10) a composition made using a bacteria of the present invention; 

11) a method of making a composition, including providing at least 
one nucleic acid of die present invention and synthesizing at least 



12) a composition made using a nucleic acid of the present invention; 

1 3) a method of making a composition including providing at least 
one polypeptide of the present invention and synthesizing at least 
one ecteinascidin; 

1 4) a composition made using a polypepti de of the present invention; 

15) a method for klentifying at least one nucleic acid molecule 
encoding at least one activity of in volved in the synthesis of an 
ecteinascidin, comprising contacting a nucleic acid molecule of 
the present invention with a sample, and identifying nucleic acid 
molecules in said sample that hybridize with a nucleic acid of the 
present invention; 

1 6) a nucleic acid molecule identified by a method of the present 
invention; 

17) a composition comprising a library of nucleic acid molecules 
identified using a method of the present invention; 

1 8) a method for identifying a bioaetive compound including 
contacting a compound made or identified using a bacteria of the 
present invention and det ermining the bioactivity of the 
compound; 

1 9) a compound identified by this method; 
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20) a method for identifying a bioactive compoand including 
contacting a compound made or identified using a nucleic acid 
molecule of the present invention and determining the bioactivity 
of the compound; 

21) a compound identified by this method; 

22) a method for identifying a bioactive compound including 
contacting a compound made or identified using a polypeptide of 
the present invention and determining the bioactivity of said 
compound; and 

23) a compound identified by this method. 



These aspects of the invention, as well as others described herein, can be 
achieved by using the methods, articles of manufacture and compositions of 
matter described herein. To gain a full appreciation of the scope of the present 
1 5 invention, it will be further recognized that various aspects of the present 
invention can he combined to make desirable embodiments of the invention. 



L Bacteria That Synthesize An Eeteiaascidm At Least In Part 

The bacteria maybe a member of any bacterial species, such as species 
20 belonging to the eubacteria or arehaebacteria, that synthesize an ectemascidm. at 
least in part. Preferably, the bacteria are marine bacteria, such as symbionts of 
the tunicate Ectemascidm turbinate The bacteria are isolated from their natural 
growth environment such that they are cultured in the absence of their host 
organism. 

25 Isolation, purification, and cidraring of bacteria can be accomplished 

using standard microbiological techniques as they are known, in the ait Sterile 
technique can be employed to prevent the introduction of exogenous bacteria of 
fungi into the preparation. The bacteria of interest may be isolated from E. 
turbinate by disrupting or homogenizing a preparation of E. turbinate. 

30 Disruption or homogenization maybe done by crashing, grinding, slicing, or 
rupturing the B, turbinate. This maybe done manually, with or without 
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implements such as honiogenizers, razor blades, scalpels, mortar and pestle, and 
tfce like, or may be done with, any of a variety of machines such as, but not 
limited to, a blender or a motorized hornogenizer. 

The disruptate or homogenate maybe filtered to remove large particles or 
S debris from the preparation, in tins ease the filter has a pore size large enough to 
allow bacteria to pass through. CenMfugatkm may be used to sediment the 
bacteria. Preferably this is done after removing large particles from the disruptate 
or homogeriate. After eentrifugation, die supernatant is removed and the pellet 
containing the bacteria may be resuspended in a buffer or media. 

1 0 The buffer or media to be used in resuspending the bacteria can be any 

media, but preferably is a media that promotes the growth of the bacteria, The 
media optionally contains a carbon source, a nitrogen source, salts, extracts such 
as yeast, beef, or liver extract, and'br vitamins. It may be desirable to include 
ex tracts of the E, turbinate host organism in the media. Such extracts may be 

1 5 taken from the disruptate or homogenate supernatant, or may be fractions of an 
E. titrblnata disruptate or homogenate obtained by one or more centrimgations, 
column chromatography steps, precipitations, dialyses, or other fractionation or 
purification steps. Alternatively, samples ofi?. turbinata can be processed by 
sterilization, such as by autoclavmg, and the resulting preparation used in 

20 microbiological medium by the optional inclusion of a gelling agent, such as 
pectin, agar or gelatin. Other nutrients can be added to such medium, such as 
vitamins, minerals and carbon and/or nitrogen sources. Optionally, an ocean 
environment can be created by adding autoclaved or filter sea-water, or artificial 
sea water (such as Instant Ocean) to such media. Substances that inhibit the 

25 growfli of contaminants can be included in the medium. Such substances can be 
known antibiotics, or can be other substances found to irmibit the growth of 
contaminant and can be eetemascidins. 

Ctdtures may be obtained by plating the bacteria on solid or semi-solid 
media or by propagating in liquid media and incubating the bacteria at a 

30 temperature, pressure and atmospheric conditions that allows the bacteria to 
propagate. Preferably, such culture conditions mirror or are similar to those that 
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the mieroorgarasm encounters in its native environment Pure cultures may be 
obtained by a variety of art recognized methods, such as sequent! - ! 
colonies of bacteria from the culture, and/or by sequential dilution of bacterial 
cultures. Cultures enriched for a particular nueroorganism can be obtained by 
5 culruring a sample m a selective or differential microbiological medium such 
that the relative proportion of a particular microorganism is increased relative to 
other microorganisms. Such enrichment procedures can be used restoratively to 
provide for enriched cultures of varying purity. 

The bacteria may be cultured cm solid or semi-solid media, but once 

1 0 purified, the bacteria are preferably cultured in liquid media. When, cultured in 
liquid media, the cultures may bs shaken for aeration, or air may be injected into 
the culture through a tube to introduce oxygen into the culture, if such bacteria 
are facultatively aerobic or aerobic, in metabolism. Preferably, the media used to 
culture such microorganisms minors or is related to tire natural en vironment of 

1 5 the microorganism. 

Preferably fee bacteria are grown to optimal density for the production of 
ecteinascidins or ecteinascidin precursors, and the bacterial culture is harvested 
and centrifuged to sediment the bacteria. If the culture media is to be used for 
extraction of one or more eeteinascidms or ectemascidin precursors, the 

20 supernatant is removed from the bacterial pellet. Purification of one or more 
ectemascidfus or ecteinascidin precursors may be performed through any 
methods know or developed in the future in the art, including combinations of 
any of the following: dialysis, precipitations, solvent partition, and 
chromau _i b.; Deluding high speed countercurrent chromatography 

25 (HSCCC), nonnai, and reverse phase chromatography. Other methods for 
purifying organic molecules are known in the art and may be employed as 
appropriate. 

Activities of ecteinascidins may be assayed by any of the following art- 
recognized methods; An antitumor activity may be assayed in vivo using 
30 xenografts of human tumors In nude mice (see, for example, Veloti et al, 1977- 
1983 and U.S. Patent Numbers 5,256,663 and 5,089,273, herein incorporated by 
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reference) or ex vivo (Hakala, et at, 1 987}. An anti-cancer ceil activity may be 
assayed using cell lines such as the P-388 murine leukemia ceil Hue, the HT-2.9 
human colon cancer cell line, and the A-549 human lung adenocarcinoma cell 
line (see, for example, U.S. Patent 5,478,932. and 5,089,273 hereby incorporated. 
5 by reference and references disclosed therein.) A. cytotoxic activity can be 
assayed by using the LI 210 ranrine len&emia cells or the CV-I monkey kidney 
cell line as disclosed in U.S. Patents 5 f 478,932 and 5,256,663, both hereby 
incorporated by reference. Assays to determine antibacterial and antiviral 
activities are also well known in the art and may be done as disclosed in U.S. 
1 0 Patent numbers 5,089.273 and 5,484,717, and references disclosed therein. 

IL Nucleic Acid Molecules That Encode Polypeptides That Catalyze the 
Synthesis of An Ecteinascidin At Least la Part and Libraries of Sueb 
Nucleic Acid Molecules 

1 5 The present invention includes a composition including at least one 

nucleic acid molecule, such as a substantially purified or purified nucleic acid 
molecule, that encodes at least a portion of at least one polypeptide that catalyzes 
at least one step in the synthesis of at least, one ecteinascidin. Preferably, at least 
one nucleic acid molecule is derived from at least one marine organism. The 

20 nucleic acid molecules of the present invention can comprise the nucleic acid 
molecules disclosed herein, including PGR primers, portions thereof, and nucleic 
acid molecules that selectively hybridize with or have substantial identity with 
the nucleic acid molecules disclosed herein or portions thereof or encode at least 
one conservative amino acid substitution relative to the disclosed sequences or 

25 portions thereof. A nucleic acid molecule of the present invention can be DMA 
JRNA, single stranded, double stranded or any combination thereof, 

A nucleic acid molecule of the present invention preferably encodes at 
least a portion of a polypeptide involved in the synthesis of at least one 
ecteinascidin. Preferably, the polypeptide is at least a portion of a protein or 

30 complex that has non-ribosomal peptide transferase activity, that has O-rnethyl 
■transferase activity, that has transaminase activity or that participates in the 
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synthesis or modification of one or more ectemascidins. A nucleic acid molecule 
of tile present invention preferably encodes at least a portion of at least one such 
active domain and can include at least one activity of such an active domain, 
preferably an activity hat yzes at 1 i < in step in the synthesis of an 
5 ecteinasctdsn. Preferably, a nucleic acid molecule of the present invention is 
between about 10 bases or base pairs and about 500 Kb, between about 100 
bases or base pairs and about. 250 Kb or between about 500 bases or base pairs 
and about 100 Kb in length. 

A nucleic acid molecule of the present invention can be derived from at 

10 least one marine organism. A marine organism can include any organism that 
can be found in a marine environment, either naturally or xenotypically. A 
marine organism can be a vertebrate, an invertebrate or a unicellular organism, 
such as a fungi, algae or bacteria. Preferably, a marine organism, is an 
invertebrate, such as an Ecteinascidia^ such as Ecteinascidia turbinate* or a 

15 unicellular organism, such as a bacteria, such as a bacteria associated with an 
Ecteinascidia, such as Ecteinascidia turbinate. 

A nucleic acid of the present invention can also encode a fusion protein 
that includes a polypeptide of the present invention and a polypeptide of interest. 
A polypeptide of interest can be any polypeptide, but is preferably a detectable 

20 label, such as green fluorescent protein, or a sequence that aids in the purification 
of a polypeptide, such as FLAG, A nucleic acid that encodes a fusion protein 
can be made by operably linking a nucleic acid that encodes a polypeptide of 
interest with a nucleic acid that encodes a polypeptide of the present invention. 
The operable linking can be direct or indirect, such as in the case where a linker 

25 connects the polypeptide of the present in vention with a polypeptide of interest. 
The nucleic acid molecule of the present invention and the nucleic acid that 
encodes a polypeptide of interest are preferably operably linked in frame such 
that an operable polypeptide of the present invention and an operable 
polypeptide of interest are translated from the nucleic acid, but that need not be 

30 the case. 
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Nucleic acid molecules of the present invention can be made using 
methods blown in the art and described herein (see, Sambrook et ai, supra 
(1989); Ausubei et at, supra (1998)). For example, nucleic acid molecules of 
the present invention can be identified and isolated using PGR methodologies, 
5 including RT-PCR, and sequenced using established methods such that their • 
homologies can be determined. The ability of one nucleic acid molecule to 
hybridize with another can be determined through experimentation under a 
vari ety of stringencies, or can be estimated based on their length and G:C 
contents. Alterations of identified sequences can be made using routine 

1 0 methods, such as mutagenesis, RT-PCR or other PCR methods (See, Sambrook 
et aL, 1989; Ausubei et al, 1998). 

A nucleic acid molecule of the present invention can include at least one 
expression control sequence. Preferably, an expression control sequence is 
operably linked to a nucleic acid molecule such that the nucleic acid molecule 

1 5 can be expressed in an hi vivo or in vitro transcription and/or translation system. 
The choice of c qa sssioi a >ntrol sequences is dependent upon the transcription 
system to be used. For example, if a prokaryotic organism such as E. coli is to 
be used to express a nucleic acid molecule, then at least one appropriate 
prokaryotic expression control sequence would be used. Likewise, if a 

20 eukaryotic organism is to be used to express a nucleic acid molecule, then at 
least one appropriate eukaryotic expression control sequence, such as the CMV 
promoter or LTRs would be used. Such nucleic acid molecules can be in any 
form, such as in a piasmid or in a linear form. 

A nucleic acid molecule of die present invention can be provided with or 

25 without expression control sequences in a vector, such as a piasmid or a viral 
vector. Viral vectors can be chosen so that they are appropriate for a cell to be 
transfected, such as, for example, a phage, cosmid, retrovirus, vaccinia virus, 
adenovirus or adeno-associated vims. Viral vectors can introduce a nucleic acid 
molecule into a cell during its normal biological processes. Non-viral vectors 

30 can be used to introduce a nucleic acid molecule of the present invention into a 
host cell using methods known in the art. such as lipofection, cold calcium 
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< hi -rue treatment or electroporation. The nucleic acid molecule in a cell can be 
extracbromosomal or be integrated into the genome of the cel l. The host cell can 
be any appropriate host cell, such as a eukaryohe or prokaryotLc cell Preferably, 
a nucleic acid molecule of the present invention is expressed in the cell, but that 
5 is not a requirement of the invention. Preferably, the cell does not normally 
include a nucleic acid molecule of the present invention or express a polypeptide 
of the present invention, but that need not be the ease. For example, a cell thai 
expresses a relatively low amount of a polypeptide of the present in vention can 
be made to express relatively higher amounts of a polypeptide once transfected 

1 0 with a nucleic acid of the present invention. 

Cells that express a polypeptide of the present invention can be scr eened 
for and selected using a variety of methods, including those set forth in the 
present invention. For example, immunoassays, such as western blots, can be 
used to identify cell lysates that inelade a polypeptide of the present invention. 

15 In addition, ixmmuiocytochsmistry can be used can be used to identify and 

localize a polypeptide of die present invention on or within a cell. Furthermore, 
in situ hybridization methods, such as FISH, can be used to identify and localize 
nucleic acid molecules within a cell and hybridization methods can be used to 
identify nucleic acid molecules, either DNA or RNA for cellular preparations. 

20 Cells or cell lysates can be screened for an activity using a v ariety of methods. 
For example, the ability of a cell or cell lysate to bind with a substrate or convert 
a substrate, including a def ectably labeled substrate, can be used to detect a 
particular activity (see Haygood and Davidson, 1997). 

Nucleic, acid molecules of the present invention can encode peptides that 

25 have non-ribosom al peptide synthetase activity, have O-methyl transferase 
activity, have transaminase activity, or that participate in the synthesis or 
modification of one or more eciemaseidms. Nucleic acid molecules encoding 
peptides having such activity can he identified by making comparisons of nucleic 
acid sequence or translation amino acid sequences derived therefrom using 

30 methods known in the art, including BLAST comparisons. A nucleic acid 

molecule of the present invention can be expressed and the expression products 
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screwed and confinned for having such activity. In addition, nucleic acid 
molecules of the present invention can encode polypeptides that have other 
activities. Methods for screening such activities are known in the art 

The present invention also includes a library of nucleic acids of the 
5 present invention, A library of nucleic acids includes between about two, about 
four, about six, about eight, about tea, about thirty, about seventy, about one- 
hundred, about one-thousand, about ten-thousand, about one-hundred thousand 
or about ones-million nucleic acid molecules and about three, about five, about 
seven, about twenty, about fitly, about five-hundred, about fifty-thousand, about 

1 0 five-hundred thousand and about ten million nucleic acid molecules. The 
members of such a library are preferably different nucleic acid molecules, but 
that need not be the case. 

The nucleic acid molecules of the present invention can be used for a 
variety of applications, and therefore include bin are not limited to. PCR primers, 

15 probes to identify similar sequences, and constructs for in vitro or in vivo 

transcription and/ or translation to make polypeptides of the present invention. 
The particular application of a nucleic acid molecule depends on a variety of 
factors, such as they are known in the art, include the length, strandedness (single 
stranded or double stranded and positive sense or negative sense), chemical 

20 characterization (such as DNA or RNA) or whether the nucleic acid molecule is 
detectably labeled or not 

III. Polypeptides That Catalyzes the Synthesis of An Ecteinascidin At 
Least in Part and Libraries Thereof 

25 The present invention also includes a composition including at least one 

polypeptide or a portion thereof that catalyzes at least one step in the synthesis of 
at least one ecteinas idin, v her in the at least one polypeptide or a portion 
thereof is preferably derived from at least one marine or ganism. 

A polypeptide of the present invention can be derived from at least one 

30 marine organism. A marine organism can include any organism that can be 
found in a marine environment, either naturally or xenotypicaliy. A marine 



30 



PCT/US02/03703 



ism can be a vertebrate, an in\ r eftebrate or a unicellular organism, such as a 
fungi, algae or bacteria. Preferably, a marine organism is an invertebrate, such as 
an Eeteinascidia such as Ecteinasi i . ta 3 or a unicellular organism, such 

as a bacteria, such as a bacteria of the present invention. 
5 The nucleic acid molecules of the presen t invention can be translated to 

pro vide polypeptides, These polypeptides can be substantially purified or 
purified and preferably have at least one activity of a polypeptide that 
participates in the synthesis of an ectekaseidin, such as but not limited to, a non- 
ribosomal peptide synthetase (NRPS), an O-methyl transferase, a transaminase, 
10 or any polypeptide that participates in the synthesis or modification of an 

ecteinaseidim Assays for non-ribosomal peptide synthetases are known in the art 
(see for example, Marahiel et ah, 1.997), and references disclosed therein). The 
activity of the polypeptide can be screened and confirmed using methods known 
in the art, later developed or described herein. For example, antibodies that bind 

15 with active portions or fragments of an activity can be used to identify 
appropriate polypeptides. Alternatively, substrates for an activity, such as 
substrates that are detectably labeled, can be used to detect the binding of a 
substrate to an ac t i vity or the conversion of a substrate to a product. 

A polypeptide of the present invention can be of any length, hut is 

20 preferably between about 10 amino acids and about 300,000 ammo acids in 
length and more preferably between about 100 amino acids and about 100,000 
amino acids in length or between about 1,000 amino acids and about 30,000 
amino acids in length. 

The polypeptide of the present invention can be made using recognized. 

25 methods, such as by way of recombinant methods as they are known in the art 
(see, Sambrook et al, 19S9; Ausubel et ah, 1998) or by digesting proteins or 
polypeptides. For example, nucleic acid molecules encoding or suspected of 
encoding a polyp epti de of the present Invention can be cloned into expression 
vectors that are transformed into appropriate host cells whore the nucleic acid 

30 molecules are expressed. The resulting polypeptides can be optionally purified 
and their acti vity confirmed using methods of the present invention or as they are 
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known in the art or later developed. Alternatively, the in vivo activity of 
polypeptides can be confirmed using methods of the present invention or as they 
are known in the art 

A polypeptide of the present invention can be provided ex- vivo or within 
5 a ceil. A polypeptide of the present invention can he expressed within a cell by 
transfecting a ceil with a nuc leic acid molecule that encodes a polypeptide of the 
present invention. The nucleic acid molecule of the present invention can be 
operahly linked to expression control sequences appropriate for the cell such that 
the nucleic acid molecule of the present invention is expressed on or within the 

1.0 cell. The nucleic acid molecule can also encode a fusion protein such that the 
fusion protein is expressed on or within the cell In this instance, a fusion 
protein that includes a detectable label as die polypeptide of interest can be used 
to confirm the presence or track the location of the fusion protein in the cell. 

A polypeptide of the present invention can also be part of a fusion protein 

1 5 that includes a polypeptide of the present invention and a polypeptide of interest. 
A polypeptide of interest can be any polypeptide, but is preferably a detectable 
label, such as green fluorescent protein, or a sequence that aids in the purification, 
of a polypeptide, such as FLAG. A fusion protein that includes a polypeptide of 
the present invention can be made from a nucleic acid that encodes a fusion 

20 protein can be made by operabty linking a nucleic acid that encodes a 

polypeptide of interest with a nucleic acid that encodes a polypeptide of the 
present invention. The operable linking can be direct or indirect, such as in the 
case where a linker connects the polypeptide of the present invention with a 
polypeptide of interest. The nucleic acid molecule of the present invention and 

25 the nucleic acid that encodes a polypeptide of interest are preferably operably 
linked in frame such that an operable polypeptide of the present invention and an 
operable polypeptide of interest are translated from the nucleic acid, but that 
need not be the case. The present invention also includes such fusion proteins or 
libraries of such fusion proteins. 

30 The present invention also includes a library of polypeptides of the 

present invention. A library of polypeptides of the present invention, including 



3.2 



WO 02/S64W3 



P< T/i S02/03703 



fusion proteins, includes between about two, about four, about six, about eight, 
about ten, about thirty, abotit seventy, about one-hundred, about one-thousand, 
about ten-thousand, about one-hundred thousand or about one-million 
polypeptides and about three, about five, about seven, about twenty, about fifty, 
5 about five-hundred, about fifty-thousand, about five-hundred thousand and about 
ten million polypeptides. The members of such a library are preferably different 
polypeptides, but that need not be the ease. 

IV. Antibodies that Bind with a Bacteria of the Present Invention or a 
1 0 Polypeptide of the Present Invention 

The present invention also includes antibodies that bind with or 
specifically bind with a bacterial or polypeptide of the present invention. The 
present invention also includes active fragments of such antibodies and methods 
of detecting bacteria of the present invention and polypeptides of the present 
1 5 invention. In such methods, the binding of an antibody to a target can be 

detected using direct or indirect detection methods, preferably using a detectable 
label, such as an enzyme, particle, or fluorescent moiety such as green 
fluorescent protein. 

The present invention also includes antibodies that bind with or 
specifically bind with a bacteria of the present invention. Such antibodies can. be 
polyclonal or monoclonal and can be made using methods known in the art (see, 
Barlowe and Lane, 1988). The antibodies can be whole antibodies of any class 

25 or subclass or mixture thereof, and can include active fragmen ts of such 

antibodies, such as Fab fragments made using methods known in the art. The 
specificity of such antibodies can be screened and confirmed using assay formats 
known in the art, such as enzyme linked immunosorbent assays (ELISAs) or 
other appropriate immunoassay formats. Such antibodies are useful in detecting 

30 the presence and/or amount of such bacteria in a sample. 
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The present invention also includes antibodies that bind with or 
specifically bind with a polypeptide of the present invention. Such antibodies 
can be polyclonal or monoclonal and can be made using methods known in the 
5 art (see, Hariowe and Lane, 1988). The antibodies can be whole antibodies of 
any class or subclass or mixture thereof, and can include active fragments of 
such antibodies, such as Fab fragments made using methods known in the art. 
The specificity of such antibodies can be screened and confirmed using assay 
formats known in the art, such as using enzyme linked immiinosorbent assays 
1 0 (ELISAs) or other appropriate immunoassay formats. Such antibodies are useful 
in delecting the presence and/or amount of such polypeptides in a sample. 

V. Method of Making Eetefaasddtas Using Bacteria, Nucleic Acids or 
Polypeptides of the Present Invention and Compositions Made Thereby 

15 The present invention also includes a method of making a composition 

including providing at least one bacteria, nucleic acid molecule or polypeptide of 
the present invention, and synthesizing at least one ecteinascidin or precursor 
thereof, 

20 Bacteria 

At least one species of bacteria of the present invention can be used to 
perform at least one step in the synthesis of an ecteinascidin. The ecteinascidin 
or ecteinascidin precursor or precursors synthesized by this method can be any 
previously known or unknown ecteinascidin, including novel ectemastidms 

25 synthesized by modification of the biosynthesis pa thway of a naturally occurring 
ecteinascidin, and including novel compounds synthesized using nucleic acids or 
polypeptides of the present invention that participate in ectemascidin synthesis. 

The bacteria may be a member of any bacterial species, such as species 
belonging to the enbacieria or archaebacteria, that synthesize an ecteinascidin, at 

30 least in part. Preferably, the bacteria are marine bacteria, such as syrnhionts of 
the tunicate Ectetnasctdia turbinate. The bacteria are isolated from their natural 
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growth environment such feat they are cultured in the absence of their host 
organism. Bacteria of fee present invention may or may not be transformed with 
nucleic acids from other organisms, such as other bacterial species. Polypeptides 
encoded by such nucleic acids may participate hi the synthesis of ecteixtascidins, 
5 including ectemascidins not naturally synthesized by fee cultured bacteria, 
including modified forms of naturally occurring ectemascidins. 

Alternatively, fee bacteria may be novel or previously identified bacteria 
feat do not naturally synthesize ectemascidins, but are transformed with nucleic 
acids of the present invention such feat expression of fee nucleic acids allow fee 

1 0 bacteri a synthesize one or more ectemascidins or one or more eetemascidin 
precursors , The bacteria may be of any bacterial species, such as species 
belonging to fee eubacteria or archaebacieria. In addition, novel or previously 
identified bacteria, such as Streptomyces lavendulae, Pseudomanas fluoresceins 
and Myxococcus xanthus, that synthesize compounds similar to ectemascidins, 

1 5 may be transformed wi fh nucleic acids of the present invention such, feat the 
bacteria produce novel compounds, 

Isolation, purification, and culturing of bacteria can he accomplished 
using standard microbiological techniques as they are known in fee art. Sterile 
technique can be employed to prevent the introduction of exogenous bacteri a of 

20 fungi into fee preparation. The bacteria of interest may be isolated from £ 
turbinate by disrupting or homogenizing a preparation of B> turbinata. 
Disruption or homogemzation may be done by crushing, grinding, slicing, or 
rapturing fee E. turbinata. This may be done manually, with or without 
implements such as homogenizers, razor blades, scalpels, mortar and pestle, and 

25 fee like, or may be done with any of a variety of machines such as, but not 
limited to, a blender or a motorized homogenizes 

The disruptate or homogenate may be filtered to remove large particles or 
debris from the preparation, hi this case the filter has a pore size large enough to 
allow bacteria to pass through. Centrimgation may be used to sediment the 

30 bacteria. Preferably tins is done after removing large particles fiom fee disruptate 
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or homogenaie. After cenirifugaikm, the mperaatant is removed and the pellet 
containing the bacteria may be resmpended in. a buffer or media. 

The buffer or media to be used in resuspending the bacteria can be any 
media, but preferably is a media mat promotes the growth of the bacteria. The 

5 media optionally contains a carbon source, a nitrogen source, salts, extracts such 
as yeast, beef, or liver extract, and/or vitamins. It may be desirable to include 
extracts of the E. tttrbtnata host organism in the media. Such extracts may be 
taken firora the disruptate or homogenate supernatant, or may be fractions of an 
R turbinate disruptate or homogenate obtained by one or more centrirugations, 

1 0 column chromatography steps, precipitations, dialyses, or other fractionation or 
purification steps. Alternatively, samples of E. turbmata can be processed by 
sterilisation, such as by aiiteclaving, and the resulting preparation used in 
microbiological medium by the optional inclusion of a gelling agent, such as 
pectin, agar or gelatin. Other nutrients can be added to such medium, such as 

1 5 vitamins, minerals and carbon and/or nitrogen sources . Optionally, an ocean 
environment can be created by adding autoclaved or filter sea-water, or artificial 
sea water (such as Instant Ocean) to such media. 

Cultures may be obtained by plating the bacteria on solid or semi-solid 
media or by propagating the bacteria in liquid media and incubating the bacteria 

20 at a temperature, pressure and atmospheric conditions that allows the bacteria to 
propagate. Such culture conditions may mirror or be similar to those that the 
microorganism encounters in its native environment. 

Pure cultures may be obtained by a variety of art recognized methods, 
such as sequentially streaking colonies of bacteria from the culture. Cultures 

25 enriched for a particular microorganism can be obtained by culturmg a sample in 
a selective or differential microbiological medium such that the relative 
proportion of a particular microorganism is increased relative to other 
microorganisms. Such enrichment procedures can be used restoratively to 
provide for enriched cultures of varying purity. 

30 The bacteria may be cultured on solid or semi-solid media or in liquid 

media, hut once purified, the bacteria are preferably cultured in liquid media. If 
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the medium is a solid or semi-solid medium, a gel] I g age such as pectin, agar 
or gelatin is included. The medium optionally includes a nitrogen source, salts, 
extracts such as yeast, beef, or liver extract, and/or vitamins. It may be desirable 
to include extracts ofthe£ turbinate host organism in the media. Such extracts 
5 may he taken from the disruptate or homogenate supernatant, or may be tractions 
of an E. turbinate disruptate or homogenate obtained by one or mote 
cenaifugations, column chromatography steps, precipitations, dialyscs, or other 
fractionation or purification steps. Alternatively, samples of M, turbinata can be 
processed by sterilization, such as by autoclaving, and the resulting preparation 

1 0 used in microbiological medium. Optionally, an ocean environment can be 
created by adding autoclaved or filter sea-water, or artificial sea water (such as 
Instant Ocean) to .such media. When cultured in liquid media, the cultures may 
be shaken for aeration, or air maybe injected into the culture through one or 
more tubes to introduce oxygen into the culture, if such bacteria are facultatively 

1 5 aerobic or aerobic in metabolism. Preferably, the media used to culture such 
microorganisms mirrors or is related to the natural environment of the 
microorganism. 

For production of ecteinascidins, bacteria maybe cultured under 
appropri ate conditions of temperature, salt, nutrients, pH, and aeration. The 

20 growth media preferably contains compounds thai promote optimal growth of 
the bacteria, such as but not limited to, salts, sugars, amino acids, vitamins, 
extracts, and/or buffering agents. The bacteria maybe cultured under 
fermentation conditions such that they may be grown to high density. In 
fermentative growth, it may be desirable to add compounds, such as sodium 

25 hydroxide, to stabilize the pH of the media as the cells grow. Alternatively, 
media may be exchanged during the period of culturing such that the growth 
media retains optimal levels of pH and. nutrients. 

Precursors of ecteinascidins, such as but not limited to amino acids, may 
be added to the bacterial media. Ecteinascidm precursors, and other compounds 

30 soeh as nutrients, may be added to the culture one or more times before and/or 
after inoculation of the media with bacteria. 
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Culture media may be harvested daring the period of growth for isolation 
of ecteinascidins. Alternatively, the bacteria, the media, or both may be 
harvested at the end of the growth period. 

Ecteinascidins or ecteinascidin precursors may be isolated and purified or 

5 partially purified from the media, bacteria, or both using appropriate methods as 
they are known or developed in the art, Such methods may include one or more 
steps of one or more of the following: centrifugation. phase separation, 
precipitation, chroma tography, dialysis. The purity of the eoteinascidins or 
ecteinascidin precursors may be evaluated by chemical or biophysical means, 

1 0 such as ID or 2D NMR and their activity assayed by antitumor, anticancer, 

cytotoxic, antibacterial, or antiviral assays as they are known, in tie art, including 
those disclosed in U.S, patents 5,256,663, 5,478,932,5,484,717, and 5,089,271 
The present invention also includes compounds made or identified using 
the present invention. For example, the present invention includes ecteinascidins 

1 5 made using at least one method of the present invention . A compound made or 
identified using a method of the present invention can be a novel or non-novel 
compound. For example, a compound of the present invention can include a 
compound that was not or was not novel on the date of the filing of the present 
application, or one year or six months prior to the filing date of the present 

20 application, 

A compound of the present invention can be provided with at least one 
phamiaoeutically acceptable carrier as they are known in the art and discussed 
herein. Such phannaceutically acceptable carriers are known in the art and are 
disclosed herein. A compound of the present invention can also he a 
25 pharmaceutical composition, 

Nucleic Acid M olecules and Polypeptides 

At least one nucleic acid molecule of the present invention or at least one 
. polypeptide of the present invention can be expressed and used in a system to 
30 synthesize an ectemascidin. The ectemascidin or precursors thereof can be 
previously known or unknown ecteinascidin or precursors thereof. 
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The present invention utilises at least one nucleic acid molecule of the 
present invention and/or at least one polypeptide of the present invention in such 
methods to make known or novel ectdnascidins or related compounds, including 
Et-743. The present invention can utilize at least one nucleic acid molecule of 
5 fee present invention and/or at least one polypeptide of the present invention 
alone or in combination with other genes. These polypeptides and genes can. be 
known or later developed and can be derived from marine, aquatic or terrestrial 
organisms. 

For example, some methods of synthesizing ecteinascidins may provide 

10 cassettes that include a gene complex that encodes at least one ecteinascidin 
synthesis activity, such as a non-ribosomal peptide synthetase activity, either in 
whole or in part. Nucleic acid molecules of the present invention can be inserted 
into such cassettes randomly or non- randomly, including replacing identified 
genes. When nucleic acid molecules of the present invention are inserted non- 

1 5 randomly into such cassettes, they can be inserted in-frame to replace existing 
genes that encode polypeptides that, have functions similar to the polypeptide 
encoded by a nucleic acid of the present invention. The nucleic acid molecules 
of the present invention are thus expressed as polypeptides of the present 
invention, which can act as part of a gene complex to produce known or novel 

20 compounds, such as ecteinascidins. 

Cells or extracts thereof (such as substantially purified extracts) that 
include one or more of the nucleic acid molecules of the present invention or one 
or more polypeptides of tire present invention can be used to synthesize a wide 
variety of ecteinascidins including Et-743 . Such cells or extracts thereof can be 

25 contacted with a variety of compounds, including substrates for an ecteinascidin 
synthesis activity, particularly an ecteinascidin synthesis activity present in the 
cells or extracts thereof. Polypeptides expressed from nucleic acids of the 
present invention can act on these compounds in order to make a wide variety of 
ecteinascidins including Et-743. In one aspect of the present invention, more 

30 than one cell and/or extract thereof can be used in combination or sequentially 
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such that fee products made by combination of cells or extracts can be 
determined and its activity confirmed. 

Alternatively, polypeptides expressed from nucleic acids of the present 
invention can be produced and purified or substantially purified for use in the 
5 manufacture of one or more ectemaseidins, one or more eeteinascidin precursors, 
or one or more ecteii iscidis J red compounds, including novel compounds. 
The polypeptides may be produced by expression of fee nucleic acids of the 
present invention in any suitable system, including in vitro transcription and 
translation systems and in vivo expression systems. For example, a. polypeptide 

1 0 of the present invention can be expressed within a cell, by transfecting a cell with 
a nucleic acid molecule that encodes a polypeptide of fee present invention. The 
nucleic acid molecule of fee present invention can be operably linked to 
expression control sequences appropriate, for the cell such that fee polypeptide of 
fee present invention is expressed on or within the cell, or the polypeptide of the 

15 present invention that is expressed within a cell is secreted from the cell Cells 
for expression of a polypeptide of tire present invention include but are not 
limited to bacterial ceils, such as E. coli or Bacillus cells, yeast cells, such as 
Saccharomyces cerevisiae, Pichia pasioris, and Pickia metholica, eiikaryotic 
cells, such as plant cells, insect cells, or mammalian cells including without 

20 limitation COS- 1 cells (ATCC No, CRL 1650), COS-7 cells (ATCC No. CRL 
1651), BHK cells (ATCC No. CRL1632), BHK 570 ceils (ATCC No. CRL 
1 0314), 293 cells (ATCC No. CRL 1573) and CHO-K1 cells (ATCC No. CCL 
61). Transformed or transfected cells can be cultured according to conventional 
procedures in a culture medium containing nutrients and other components 

25 required for the growth of the particular host cells, and preferably, for the 
selection of cells containing the exogenously added nucleic acid. 

A polypeptide- of the present invention may be purified from the cell 
culture using well-established methods of protein purification, preferably 
including affinity purification (see, for example, Scopes, 1 994; 1 988). 

30 The present invention also includes compounds made or identified using 

the present invention. For example, the present invention includes ectemascidins 
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made using ai least one method of tie present invention. A compound made or 
identified using a method of the present invention can be a novel or non-novel 
compound. For example, a compound of the present invention can. include a 
compound that was or was not novel on the date of the filing of fee present 
5 application, or one year or six months prior to the filing date of the present 
application, 

A compound of the present invention can be provided with at least one 
phannaceutically acceptable carrier as they are known in the art and discussed 
herein. Such pharmaceutically acceptable carriers are known in the art: and are 
1 0 disclosed herein- A compound of the present invention can also be a 
pharmaceutical composition. 

VI. Method of Identifying Nucleic Acid Molecules, Nucleic Acid 
Molecules Identified Thereby and Libraries of Nucleic Acid Molecules 

1 .5 The present invention also includes a method for identifying at least one 

nucleic acid molecule encoding at least one ectemascidin synthesis activity 
including contacting a nucleic acid molecule of the present invention with a 
sample, and identifying nucleic acid molecules in said sample that hybridize with 
said nucleic acid molecule of the present invention. This aspect of the present 

20 invention utilizes nucleic acid molecules of the present, invention as probes or 
PGR. primers in order to identify nucleic acid molecules that have or are expected 
to encode polypeptides that have ecteinascidtn synthesis activity. 

Samples for use in die present invention can be from any source that can 
include a nucleic acid molecule, but preferably include samples from an 

25 environmental sample, such as the marine environment. The samples can 
include marine organisms, including invertebrates or vertebrates or any other 
marine organism.. Preferably, the sample includes single celled organ sms, such 
as bacteria, More preferably, the sample includes samples that are expected to 
contain an ectemascidin. Such samples include, for example, Ecteinastidia 

30 species, including Ecieinascidia turbinate . 
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When used as probes, nucleic acid molecules of the present invention 
can be detectably labeled and contacted wife a sample. Nucleic acid molecules 
feat bind with the nucleic acid of fee present invention can be identified, cloned 
and sequenced using methods known in fee ait. The identified nucleic acid 
5 molecules can be operably linked to expression control se juences such feat a 
polypeptide encoded by the identified nucleic- acid molecule can be made and 
characterized. 

When used as PGR primers, the nucleic acid molecules of the present 
invention can be used to amplify nucleic molecules in a sample. One or more 

1 0 than one of the primers used in one or more amplification reactions can be a 
nucleic acid molecule of the present invention. The amplified nucleic acid 
molecules are presumptively derived from a gene encoding an ecteinascidin 
synthesis activity. The amplified nucleic acid molecules can be identified, 
cloned and sequenced using methods known in the art. The amplified nucleic 

15 acid molecules can be operably linked to expression control sequences such that 
a polypeptide encoded by fee amplified nucleic acid molecule can be made and 
characterized. 

The nucleic acid molecules of fee present invention can also be used to 
identify nucleic acid molecules that are upstream or downstream from a targeted 

20 segment of a gene encoding an ecteinascidin synthesis activity, A nucleic acid 
molecule that encodes a conserved region of a gene encoding an ecteinascidin 
synthesis activity, A nucleic acid molecule of the present invention can be used 
in primer extension or inverse PGR methods such that upstream or downstream 
segments from the point of hybridization are identified. These extended 

25 segments can be identified, cloned and sequenced using methods known in the 
art The extended segments can be operably linked to expression control 
sequences such feat a polypeptide encoded by fee extended segment can be made 
and characterized. 

Whether fee nucleic acid molecule of the present invention is used as a 

3 0 probe, primer or PGR primer, fee methods of the present invention identify 
nucleic acid molecules that presumptively encode at least a porti on of a gene 
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encoding an ecteinascidin. synthesis aotivity. Any of these processes can be used 
alone, in combination, or restoratively to Identify at least portions of genes 
encoding an ecteinascidin synthesis activity in a sample. 

The present invention also includes nucleic acid molecules identified by 
5 the present invention- The identified nucleic acid molecules can include 
expression control sequences operably linked to the identified nucleic acid 
molecules. Such constructs can. be used to make polypeptides encoded by the 
identified nucleic acid molecules and the polypeptides can be characterized as to 
a variety of structures and functions, particularly structures and functions 
10 associated with genes encoding an ecteinascidin synthesis activity. The present 
invention includes a library of nucleic acids, cells or polypeptides identified by 
the present invention. 

VIL Meth od of Iden tifying a Bioactive Compound, Bioactive Compounds, 
15 and Therapeutic Compositions 

The present invention also includes a method for identifying a bioactive 
compound, such as those made or identified using a bacteria, nucleic acid 
molecule or polypeptide of the present invention, including contacting a 
compound made or identified by the present invention with at least one in vitro, 

20 ex vivo or in vivo assay system and determining the bioactivity of said 

compound. The present invention includes bioactive compounds identified 
using this method. The identified bioactive compounds can be provided in a 
pharmaceutical!}' acceptable carrier and can be a pharmaceutical compound. 
In vitro, ex vivo and in vivo systems used in the present invention are 

25 preferably those known in the art for a bioactivity to be identified. The assay 
chosen to be used in this method is related to a bioactivity that is being screened 
for. Preferred systems include those that determine at least one ecteinascidin 
activity. For example, in vitro systems (systems that do not use whole organisms 
or whole cells) and ex vivo systems (systems that use whole cells or portions of 

30 cells) for the identification of ecteinascidin activity are known in. the art, 

including those disclosed in U.S. patents 5,256,663, 5,478,932,5,484,717, and 
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5,089,273, In vivo systems (systems that use whole organisms or tissues or 
organs derived therefrom) are also known in the art (see, for example, Valott et 
ah, 1 998). Compounds are identified using these methods as having a desired 
bioactivity. Compounds identified by these methods are bioactive compounds 
5 that have at least one bioactivity. 

Sa^ening QS'compQumhJbracmMm. 

The following assays can be performed to confirm the bioactivity of a 
compound: 

10 a) antimicrobial effect on S. aureus by placing a compound on a paper 

disk and determining the ability of the compound to inhibit the growth of the S. 
aureus (Benson, 1994), The results of tins assay establish the toxicity of the 
compound towards Gram-positive bacteria. 

b) antimicrobial effect on E. coli by placing a compound on a paper 

1 5 disk and determining the ability of the compound to inhibit the growth of the E. 
coli (Benson, 1994). The results of this assay establish the toxicity of a 
compound towards Gram-negative bacteria. 

c) antimicrobial effect on Candida albicans by placing a. compound on 
a paper disk and determining the ability of the extract to inhibit the growth of 

20 Candida albicans (Benson, 1994). The results of this study establish the toxicity 
of compounds towards yeasts and fungi. 

d) inhibition of the growth of cancer cells by contacting a compound 
with the National Cancer Institute's (NCI) cell line screen (approximately sixty 
cell lines) against up to fifty-one cancer cell types in vitro (Boyd et ai, 1995). 

25 These results provide an activity profile for the compound. The activity profiles 
of an extract can be compared to the activity profiles of other samples in the NCI 
database of activity profiles. Similar activity profiles of different extracts, 
including known extracts with known modes of action, strongly suggests that the 
samples have similar modes of action. A novel activity profile strongly suggests 

3 0 that the compoun d has a novel mechanism of action. 
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e) cytotoxic activity of a compound can be determined by a variety of 
methods, including inhibition of brine shrimp by contacting twenty-four hour old 
brine shrimp nauplii for twenty-four hours with an compound and observing the 
inhibition of the activity or viability of the brine shrimp. The results of this 
5 assay estabii sh the cytotoxicity of the compound towards whole organisms. 

Compounds identified as having a bioaetivity have presumptive 
therapeutic activity. Such therapeutic activity and related pharmacological 
parameters can be confirmed using the methods discussed herein. 

10 Pharmacology and toxicity of bioactive compounds and bio acti yiiies 

The structure of a bioactive compound or bioaetivity can be determined 
or eordkmed by methods known in the art. such as mass spectroscopy. For 
bioactive compounds and bioactivitics stored for extended periods of time under 
a variety of conditions., the structure, activity and potency thereof can be 

15 confirmed. 

Identified bioactive compounds and bioactivities can be evaluated for a 
particular activity using recognized methods and those disclosed herein. For 
example,, if an identified bioactive compound or bioaetivity is found to have 
antic-ant a c ell activity in vitro, then the bioactive compound or bioaetivity 

20 would have presumptive pharmacological properties as a chemotherapeutic to 
treat cancer. Such nexuses axe known in. the art for several disease states, and 
more are expected to be discovered over time. Based on such nexuses., 
appropriate confirmatory in vitro and in vivo models of pharmacological activity 
and toxicology can be selected and appropriate tests performed. The methods 

25 described herein can also be used to assess pharmacological selectivity and 
specificity, and toxicity. 

Identified bioactive compounds and bioactivities can be evaluated for 
toxicologieal effects using known methods (see, Lu, 1985; U.S. Patent Nos; 
5,196,313 to Culbreth (issued March 23, 1993) and 5,567,952 to Benet (issued 

30 October 22, 1996)), For example, toxicology of a bioactive compound or 

bioaetivity can be established by determining in vitro toxicity towards a cell line, 
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such as a mammalian, for example human, cell line. Bioactive compounds sad 
bioaotivities can be treated with, for example, tissue extracts, such as 
preparations of liver, such as microsomal preparations, to determine increased or 
decreased toxicologies! properties of the bioactive compound of bioaetivity after 
5 being metabolized by a whole organism. .The resul ts of these types of studies are 
predictive of toxicological properties of chemicals in animals, such as mammals, 
including humans. 

Alternatively, or in addition to these in vitro studies, the toxicological 
properties of a bioactive compound or bioaetivity in an animal model, such as 

10 mice, rats, rabbits, dogs or monkey, can be determined using established 

methods (see, Lu, 1985; and Creasey, 1979). Depending on the toxicity, target 
organ, tissue, locus and presumptive mechanism of the bioactive compound or 
bioaetivity, tire skilled artisan would not be burdened to determine appropriate 
doses, LD S0 values, routes of administration and regimes that would be 

1 5 appropriate to determine the toxicological properties of the bioactive compound 
or bioaetivity. hi addition to animal models, human clinical trials can be 
performed following established procedures, such as those set forth by the 
United States Food and Drag Administration (USFDA) or equivalents of other 
governments. These toxicity studies provide the basis for determining the 

20 efficacy of a bioactive compound or bioaetivity in vivo. 
.Efficacy of bioactive compounds and Moactivities 

Efficacy of a bioactive compound or bioaetivity can be established using 
several art recognized methods, such as in vitro methods, animal models or 
human clinical trials (see, Creasey, 1979). Recognized in vitro models exist for 

25 several diseases or conditions. For example, tire abili ty of a compound or 

composition to extend the life-span of HIV-infected cells in vitro is recognized 
as an acceptable model to identify chemicals expected to be efficacious to treat 
HFV infection or AIDS (see, Daluge et. at, 1995). Furthermore, the ability of 
cyclosporin A (CsA) to prevent proliferation of T-cells in vitro has been 

30 established as an acceptable model to identify chemicals expected to be 

efficacious as immunosuppressants (see, SuthantMran et al., 1996), For nearly 
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every class of therapeutic, disease or condition, an acceptable in vitro or animal 
model is available. In addition these in vitro methods can use tissue extracts, 
such as preparations of liver, such as microsomal preparations, to provide a 
reliable indication of the effects of metabolism on a bioactive compound or 
5 bioactivity. Similarly, acceptable animal models can be used to establish 
efficacy of bioactive compounds and bioactivities to treat various diseases or 
conditions. For example, the rabbit knee is an accepted model for testing agents 
for efficacy in treating arthritis {see, Shaw and Lacy, 1973). Hydrocortisone, 
which is approved for use in humans to treat arthritis, is efficacious in Ms mode! 

10 which confirms the validity of this model (see, McDonough, 1982), When 
choosing an appropriate model to determine efficacy of bioactive. compounds 
and bioactivities, the skilled artisan can be guided by the state of the art to 
choose an appropriate model, doses and route of administration, regime and 
endpoint and as such would not be unduly burdened. 

1 5 In addition to animal models, human clinical trials can be used to 

determine the efficacy of bioactive compounds and bioactivities. The USFDA, 
or equivalent governmental agencies, have established procedures for such 
studies. 

Selectivity of bioactive compounds and bioactivities 

20 'The in vitro and in vivo methods described above also establish the 

selectivity of a candidate modulator. It is recognized that chemicals can 
modulate a wide variety of biological processes or can be selective. Panels of 
cells as they are known in the art can be used to determine die specificity of the a 
bioactive compound or bioactivity (WO 98/13353 to Whitney et al. s published 

25 April 2, 1998). Select; \ f< is evi lent, for example, in the field of chemotherapy, 
where the selectivity of a chemical to be toxic towards cancerous cells, but not 
towards non-cancerous cells, is obviously desirable. Selective modulators are 
preferable because they have fewer side effects in the clinical setting. The 
selectivity of a bioactive compound or bioactivity can be established in vitro by 

30 testing the toxicity and effect of a bioactive compound or bioactivity can be 
established in vitro by testing the toxicity and effect of a bioactive compound or 
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bioactivify on a plurality of cell lines that exhibit a variety of cellular pathways 
and sensitivities. The data obtained ibrai these in vitro toxicity studies can be 
extended to animal model studies, inckdiag human clinical trials, to determine 
toxicity, efficacy and selectivity of a bioactive compound or bioactivity. 
5 The selectivity, specificity and toxicology, as well as the general 

pharmacology, of a bioactive compound or bioactivify can he often improved by 
generating additional test chemicals based on the stxucinre/property relationship 
of a bioactive compound or bioactivity originally identified as having activity 
Bioactive compounds and hioactivities can be modified to improve various 

10 properties, such as affinity, life-time in blood, toxicology, specificity and 
membrane permeability. Such refined bioactive compounds and hioactivities 
can be subjected to additional assays as they are known in the art or described 
herein. Methods for generating and analyzing such compounds or compositions 
are known m the art, such as U.S. Patent No. 5,574,656 to Agrafiotis et al. 

J 5 Fh a rm aceu t icq I composition s 

The present invention also encompasses a bioactive compound or 
bioactivity in a pharmaceutical com] sit compri ' .i pharmaceutical!}' 
acceptable carrier prepared for storage and preferably subsequent administration, 
which have a pharmaceuticafiy effective amount of the bioactive compound or 

20 bioacfivity in a pharroaceutically acceptable carrier or diluent. Acceptable 

carriers or diluents for therapeutic use are well known in the pharmaceutical art, 
and are described, for example, in Remington's Pharmaceutical Sciences, Mack 
Publishing Co., (A.R. Gennaro edit. (1985)). Preservatives, stabilizers, dyes and 
even flavoring agents can be provided in the pharmaceutical composition. For 

25 example, sodium henzoaie, sorbic acid and esters of p-hydroxybenzoie acid can 
be added as preservatives, hi addition, antioxidants and suspending agents can 
be used. 

The bioactive compounds and hioactivities of the present invention can 
be formulated and used as tablets, capsules or elixirs for oral administration; 
30 suppositories for rectal admufistration; sterile solutions, suspensions or 
injectable administration; and the like. Mjeetables can be prepared in 
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conventional forms either as liquid solutions or suspensions, solid forms suitable 
for solution or suspension in liquid prior to injection, or as emulsions. Suitable 
excipients are, for example, water, saline, dextrose, marmttol, lactose, lecithin, 
albumin, sodium gmiamate, cysteine hydrochloride and the like. In addition, if 
5 desired, the injectable pharmaceutical compositions can contain minor amounts 
of nontoxic auxiliary substances, such as wetting agent s, pH buffering agents and 
the like. If desired, absorption enhancing preparation, such as liposomes, can be 
used. 

The pharmaceutical effective amount of a bioactive compound or 

1 0 bioaeiiv-i ty required as a dose will depend on the route of administration, the type 
of animal or patient being treated, and the physical characteristics of the specific 
animal under consideration The dose can be tailored to achieve a desired effect, 
but will depend on such factors as weight, diet, concurrent medication and other 
factors which those skilled in the medical arts will recognize, to practicing the 

1 5 methods of the present invention, the pharmaceutical compositions can be used 
alone ox in combination with one another, or in combination with other 
therapeutic or diagnostic agents. These products can be utilized in vivo, 
preferably in a mammalian patient, preferably in a human, or in vitro, to 
employing them in vivo, the pharmaceutical compositions can be administered to 

20 the patient in a variety of wax's, including parenteraUy, intravenously, 
subcutaneously, intramuscularly, colonically, rectally, nasally or 
intraperitoneally, employing a variety of dosage forms, Such methods can also 
be used in testing the activity of bioactive compounds or hioacti vities in vivo. 
As will be readily apparent to one skilled hi the art, the useful in vivo 

25 dosage to be administered and the particular mode of administration will vary 
depending upon the age, weight and type of patient being treated, the particular 
pharmaceutical composition employed, and the specific use for which the 
pharmaceutical composition is employed. The determination of effective dosage 
levels, that is the dose levels necessary to achieve the desired result, can be 

30 accomplished by one skilled in the art using routine methods as discussed above. 
Typically, human clinical applications of products are commenced at lower 
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dosage levels, with dosage level being increased until the desired effect is 
achieved. Alternatively^ acceptable in vitro studies can be used, to establish 
useful doses and routes of administration of the bioaetive compounds and 
bioacriviiies. 

5 hi non-human animal smdies, applications of the pharmaceutical 

compositions are commenced at higher dose levels, with fee dosage being 
decreased until the desired effect is no longer achieved or adverse side effects are 
reduced or disappear. The dosage for the bioaetive compounds and bioactivities 
of the present invention can range broadly depending upon die desired affects, 

10 the therapeutic indication, rou te of administration and purity and activity of the 
bioaetive compound or bioactivity. Typically, dosages can be between about 1 
ng/kg and about 10 mg/kg, preferably between about 10 ng/kg and about 1 
mg/kg. more preferably between about 100 ng/kg and about 100 micrograms/kg, 
and most preferably between about 1 nucrogram/kg and about 10 

15 microgranas/kg. 

The exact formulation, route of adnnnistration and dosage can be chosen 
by the individual physician in view of the patient's condition (see, Fingle et ah, 
in The Pharmacological Basis of Therapeutics (1975)). It should be noted that 
the attending physician would know how to and when to terminate, interrupt or 

20 adjust administration due to toxicity, organ dysfunction or other adverse effects, 
Conversely, the attending physician would also know to adjust treatment to 
higher levels if the clinical response were not adequate. The magnitude of an 
administrated does in the management of the disorder of interest will vary with 
the severity of the condition to be treated and to fee route of adimnistation, The 

25 severity of the condition may, for example, be evaluated, in part, by standard 
prognostic evaluation methods. Further, the dose and perhaps dose frequency, 
will also vary according to the age, body weight and response of the individual 
patient, including those for veterinary applications. 

Depending on the specific conditions being treated, such pharmaceutical 

30 compositions can be formulated and administered systemically or locally. 
Techniques for formation and administration can be found in Remington's 
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Pharmaceutical Sciences, ISthBd., Mack Publishing Co., Easton, P A (1990). 
Suitable routes of administration can include oral, nasal, rectal, transdermal, otic, 
ocular, vaginal, feaasmncosai or intestinal administration; parenteral delivery, 
including intramuscular, subcutaneous, intramedullary injections, as well as 
5 intrathecal, direct intraventricular, intravenous, intraperitoneal, intranasal, or 
intraocular injections. 

For injection, the pharmaceutical compositions of the present invention 
can he formulated in aqueous solutions, preferably in physiologically compatible 
buffers such as Hanks' solution, Ringer's solution or physiological saline buffer. 

1 0 For such hansmueosal administration, penetrans appropriate to the barrier to be 
permeated are used in the formulation. Such penetrans are generally known in 
the art. Use of pharmaceutically acceptable carriers to formulate the 
pharmaceutical compositions herein disclosed for the practice of the invention 
into dosages satiable for systemic administration is within the scope of the 

1 5 invention. With proper choice of carrier and suitable manufacturing practice, the 
compositions of the present invention, in particular, those formulation as 
solutions, can be administered parenterally, such as by intravenous injection. 
The pharmaceutical compositions can be formulated readily using 
pharmaceutically acceptable carriers well known in the art into dosages suitable 

20 for oral administrations. Such carriers enable the bioactive compounds and 
bioactivities of the invention to be formulated as tables, pills, capsules, liquids, 
gels, syrups, slurries, suspensions and the like, for oral ingestion by a patient to 
be treated. 

Agents intended to be administered mtraceilularly maybe administered 
25 using techniques well known to those of ordinary skill in the art For example, 
such agents may be encapsulated into liposomes, men administered as described 
above. Substantially all molecules present in an aqueous solution at the time of 
liposome formation are incorporated into or within the liposomes thus formed. 
The liposomal contents are both protected from the external micro-envhomnent 
30 and, because liposomes fuse will cell membranes, are efficiently delivered into 
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the cell cytoplasm. Additionally, das to their hydrophobicity, small organic 
molecules can be directly administered hitracelhilarly. 

Pharmaceutical compositions suitable for use hi the present invention 
include compositions wherein the active ingredients are contained in an effective 
5 amount to achieve its intended purpose. Determination of the effecti ve amount 
of a pharmaceutical composition is well within the capability of those skilled in 
the art, especially hi light of the detailed disclosure provided herein. In addition 
to the active ingredients, these pharmaceutical compositions can contain suitable 
pharraaceuticaiiy acceptable carriers comprising excipients and auxiliaries which 

10 facilitate processing of the active chemicals into preparations which can be used 
pharraaceuticaiiy. The preparations formulated for oral administration may be in 
the form of tables, dragees, capsules or solutions. The pharmaceutical 
compositions of the present invention can be manufactured in a manner that is 
itsel f known, for example by means of conventional mixing, dissolving, 

15 granulating, dragee-making, emulsifying, encapsulating, entrapping or 
lyophilizhig processes. Pharmaceutical formulations for parenteral 
administration include aqueotxs solutions of active chemicals in water-soluble 
form, 

Additionally, suspensions of the active chemicals may be prepared as 
20 appropriate oily injection suspensions. Suitable lipophilic solvents or vehicles 
include fatty oils such as sesame oil, or synthetic fatty acid esters, such as ethyl 
oleate or triglycerides or liposomes. Aqueous injection suspensions may contain 
substances what increase the viscosity of the suspension, such as sodium 
carboxymethyi cellulose, sorbitol or dextran. Optionally, the suspension can also 
25 contain suitable stabilizers or agents that increase the solubility of the chemicals 
to allow for the preparation of highly concentrated solutions. 

Pharmaceutical compositions for oral use can be obtained by combining 
fee active chemicals wife solid excipient, optionally grinding a resulting mixture, 
and processing the mixture of granules, after adding suitable auxiliaries, if 
30 desired, to obtain tables or dragee cores. Suitable excipients are, hi particular, 
fillers such as sugars, including lactose, sucrose, raamhtol or sorbitol; cellulose 
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preparations such as, for example, maize starch, wheat starch, rice starch, potato 
starch, gelatin, gum tragacanth, methyl cellulose, hydroxypropylmethyl- 
cettulose, sodium c arboxvmeihylcellulose and/or polyvinylpyrrolidone. If 
desired, disintegrating agents can he added, such as fee cross-linked polyvinyl 
5 pyrolidons, agar, algisic acid or a salt thereof such as sodium alginate. Dragee 
cores can be provided with suitable coatings. Dyes or pigments can be added to 
fee tablets or dragee coatings for identification or to characterize different 
combinations of active doses. 

The bioaciive compounds and bioactivities of the present invention, and 

1 0 pharmaceutical compositions that include such bioacti ve compounds and 

bioactivities are useful for treating a variety of ailments in a patient, including a 
human. As set forth in the Examples, the bioactive compounds and bioacti vities 
of the present invention have antibacterial, antimicrobial, antiviral, anticancer 
cell, antitumor and cytotoxic activity. A patient in need of such treatment can be 

1 5 provided a bioaciive compound or bioaetivity of the present invention, preferably 
in a pharmacological composition in an effective amount to reduce the number 
or growth rate of bacteria, microbes, cancer cells or tumor cells in said patent, or 
to reduce the mfectivity of viruses in said patient. The amount, dosage, route of 
administration, regime and endpoint cars all be determined using the procedures 
20 described herein. 

The invention will now be described by the following non-limiting 
examples. 

Example 1 

2 5 identification of Bacteria Associated With 

BeteinaseMia tttrbinata by in situ Hybridization 

Larvae of £ turbinata for in situ hybridization are fixed in 4% 
paraformaldehyde-Q. 1 M MOPS (pH 7.4>0.5 M Nad Fixed larvae are either 
used immediately or stored in 70% ethanol at -20°C. The fixed larvae are treated 
30 with proteinase K (Boehringer Mannhehn), refoed in 4% paraformaldehyde, 
treated with acetic anhydride in 0.1 M tiiethanoiamine, washed with buffer, and 
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incubated m hybridization buffer (5X SSC, 0.1% Tween 20, 5 mM BDTA, and 
IX Denhardi's solution) at 55°C for one hour, Buorescem-labeled probes are 
added in fresh hybridization buffer at a concentration of 2 ng per microliter, The 
samples are heated to 55°C for five to ten minutes mid then incubated at 43 °C for 

5 4 to 6 hours. Each wash is conducted twice for 1 5 minutes as follows; high 
stringency wash 1 is 10% forniaraide, 5X SSC, and 1% sodium dodecyi sulfate, 
used at 47°G 5 wash solution 2 is 10% fbrmamide, 2X SSC, and 1% sodium 
dodecyi sulfate, used at 37°C» and wash solution 3 is 10% formainide, 2X SSC, 
and 0.1% Tween 20, used at room temperature. The larvae are suspended in 

10 Veetasheild (Vector) mounting media and viewed under epifiuoresccnce. The 
probes used for hybridization are designed to hybridize to regions of die SSU 
rRNA gene (Haygood and Davidson, 199?) and include the universal eubacterial 
probe BUB 338 (SEQ ED NO:4) (Amann et al., 1997), the E. palauensis SSU 
rRNA probe 1247r (5-AAGTAGCGGCCCTTTGTC-3') (SEQ 2D NOT), and 

15 the & sertuia probe Bnl253r (SEQ ID NO:9) (Haygood and Davidson, 1 997). 

Example 2 

Extraction of Nucleic Acid Molecules from Samples of 
E. turbinate that Include Bacteria 

20 Unless otherwise noted, molecular biology procedures used are standard 

techniques used in the field (Sambrook et al, 1989; Ausubel et ah, I99S). 
DNA extractio n 

Samples of either adult or larval & turbinate! were obtained from the 
waters of the Flori da Keys. Free-swimming larvae were collected in the lab from 

25 E. turbinata adult colonies. Larval DNA was extracted as previously described 
"for B. neritina (Haygood and Davidson, 1997). Larvae were concentrated in L5 
milliliter microc< i itrifugc tubes' (app > irately 25 mg of larvae per tube) by 
gentle centrifugation and then rinsed four times in filtered seawater (0.2 
micrometer pore size filter) to nuniraize the contaminating seawater bacteria. 

30 Excess water was removed and the pellets were frozen at - S0 c C for later use. 
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DNA was extracted from the B. turbinata larvae using a QIAamp Tissue Kit as 
directed by fee manufacturer (Qiagen Ine,, Valencia, CA). 

Example 3 

5 IdenMcatioa of Geaes in Samples of E. turl/waia 

that Include Bacteria 

Primer desim 

Degenerate PCR primers are designed based on adenylation domains of 
non-ribosomal peptide synthetase (NRPS) genes. PCR primers are designed as 

1 0 follows: 1) using published degenerate sequences for the A-doniains, and 2) 

using published safraniycin MX1 sequences as a basis for more specific primers, 
The adenylation domain is selected as the first target because, it is the most 
highly conserved region of the NRPS genes, in addition, it is the loading domain 
for amino acids and thus serves as the basis for amino acid selectivity in the 

1 5 NRPS genes. Recently, it has been reported that specific amino acids within this 
domain can be decoded to predict which amino acid is encoded by the domain. 
Therefore, comparison of isolated A-domam sequences with those m published 
databases will provide added evidence that the genes are encoding enzymes in 
the ectemascidin pathway. 

20 

tCJLcon^: i p ration oj DM jromE. turbinata 

The PCR conditions for the initial amplification of the genes from E. 
turbinata DNA can be as follows. A total reaction volume of 50 microliters 
contains approximately 100 ng of E. turbinata DNA (either adult or larval), 1 

25 micromolar of each of two degenerate primers, and Taq polymerase and buffer 
(Boehringer Mannheim Corp., Mdianapolis, IN). A PCR protocol is optimized 
for the degenerate NRPS primers, The cycle conditions start with a "touch down" 
sequence, which lowers the annealing temperature from 60 to 40°C at a rate of 
2°C per cycle (1 1 cycles), and were then maintains at 40°C for a total of 51 

30 eyries. Cycle steps are as follows: denaturation (94*C; 1 minute), annealing 
(60°C to 40°C; 2 minutes), and extension (72°C; 1 minute). 
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DNA cloning 

PCR reactions are electrophoreses on 0.8% agarose gel and visualized 
with ethidium bromide, PCR products of the expected length are closed using a 
TOPO TA Cloning kit into the Invitrogen pCR®2.1-TOPO vector as described 
5 by the manufacturer (Invitrogen Corp., Carlsbad, CA). Recombinant clones 
containing insert DNA are then sequenced using standard protocols. Since 
NRPS enzymes can be modular clones from the degenerate PCR primers can 
represent a pool of fragments from different NRPS domains. 

Plasmid DNA for sequencing is prepared using the Qiagen QIAprep Spin 
Mhiiprep Kit (Qiagen, Inc., Valencia, CA). All sequencing is performed with an 
ABI automated sequencer (model 373A) by using a PRISM Ready Reaction 
DyeDeoxy terminator cycle sequencing kit as recommended by the manufacturer 
1 5 (Perkin-Elmer). Cloned genes are sequenced using primers directed against the 
cloning vector, pCR®2. 1-TOPO (Invitrogen, Carlsbad, CA). 

Bsamplej 

Sequences of Ecteinascidin Synthesis Activity from E. turbmata that Include 
20 Bacteria 

Cloned DNA sequences may be identified by using the BLAST (basic 
local alignment search tool) server of the National Center for Biotechnology 
Information accessed over the Internet (Altschul et at, 1997). Clones that are 
identified as having homology to known NRPS genes can be used to probe 
25 libraries, such as cosraid libraries, made from genomic DNA isolated from E. 
turbinate or bacteria of the present invention. Alternatively, inverse and long- 
range PCR methods maybe used to isolate genes and gene clusters encoding 
NRPS genes and gene clusters. 
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Example 5 
Combinatorial Biosynthesis 

Combin I » a* : >k synthesis has generally been used in the search for 
novel molecules with applications as pharmaceuticals or as platforms for 
5 combinatorial synthesis. For example, Shan et at have demonstrated that 
engineered aromatic or modular PKSs can be used to generate polyketide 
libraries of different molecular sizes and shapes (Shen et at, 1999). The 
biosynthetic genes of the present invention, for example genes for ecteinascidin 
synthesis, could he incorporated into these systems to create derivatives/analogs 

10 of ecteiuascidins with improved properties such as reduced toxicity/myalgia, 
greater efficacy etc. (Shen et at, 1999; Xue et ah, 1998). Recently, the 
erythromycin PKS genes have been engineered to effect combinatorial 
alterations of catalytic activities in the biosynthetic pathway (McDaniel et ah, 
1999). This has resulted in the successful generation of more than fifty 

15 inaerolides winch would otherwise be impractical to produce through chemical 
methods. This leads to the creation of libraries of novel "unnatural" natural 
products exhibiting altered functions (McDaniel et at, 1999). The ecteinascidin 
NRPS genes could be used in such a system, to create analogs of ecteinascidin 
with improved properties. 

20 The cloned biosynthetic genes presented here have applications in 

bioprospecting. The cloned PKS genes could be used in PGR, in situ 
hybridizations, etc to isolate novel marine (and terrestrial) NRPS and 
ectemasoidms which may exhibit novel structures and novel activities 
(antibacterial, antifungal, anticancer, etc.) 

25 
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Example 6 

The Localization of Bioaciive Secondary Metabolites in Sponges, 
Ascidians and their Associated Symbion is 

Methods 

5 Adult E. turbinate were shipped by CaSioMaiine Technologies Inc., 

Carlsbad, CA from wild colonies in Florida and maintained in sialic aquaria at 
28 C C for up to 2 weeks. Larvae were either released from the colonies in the 
morning and individually collected by pipette or excised from adult individuals 
wife a scalpel. Some larvae were collected directly in Sweetings Cay. Bahamas 
10 by dissecting gravid adults within an hour of collection. Larvae were collected 
from the adults, either by tank raising colonies or excising gravid adults. Larvae 
collected by dissection of individuals possessed a range of developmental stages 
(Figure 5). Larvae ware washed several times in sterile seawater prior to all 
experiments. 

15 Larval chemistry. Three hundred larvae (~30O mg wet weight) were 

extracted in methanol which was dried in vacuo to afford 6.9 mg of extract. This 
crude extract was dissolved in deuterated methanol (CD 3 OD) and analyzed by 'H 
NMR. spectroscopy. The sample was then dried and dissolved in DMSO at a 
concentration of 10 mg/mL and tested in a Human Colon Tumor (HOT) 

20 bioassay. A portion of the extract was dissolved in methanol (100 fig/mL) and 
analyzed using ESI mass spectrometry. 

PCR of E. turbinata bacterial 16S rRNA genes. DNA was extracted 
from either fresh larvae or larvae preserved in ethanol at 4°C using aDNeasykit 
(Qiagen). Portions of the 1 6S rRHA gene were amplified using "universal'* 

25 eubacterial primer pairs: 27f-1492r and 1055f-1392r (see Table 1 for all 
oligonucleotide sequences used) (Amann et ah, 1995). PCR reactions were 
performed with 150-200 ng larval genomic DNA, Tag polymerase, IX Mg 
buffer, 1 0 fig BSA, and 250 uM dNTP using a Hybaid thermocycler. A 30 cycle- 
program was used with the following steps: (1) 95 °C for 30 seconds for regular 

30 primers or 1 minute for GC clamp primers; (2) 50° for 1 minute; (3) 72° for 1 
minute; and (4) a final elongations step at 72° for 7 minutes. Prior to 
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sequencing, all PGR products were cleaned using a QIAQuiofc™ cleanup kit. 
Sequencing was performed using ABI PRISM™ Dye Terminator Cycle 
Sequencing. Sequences were analyzed and aligned using Seqoencher™ and by 
eye, 

5 

Table 1 



Oligonucle otide Sequences (5 f -> 3') Application"* References 

Symbiont specific 



ET1253r 


GGG UUG CGA AGC AGC 


F 


Herein 




GAUG(SEQIDNO:2) 






Domain Bacteria 






Eub2?f 


TGA GCC AGG ATC AAA 


P 


Amann et ai., 




CTCT(SEQXDNO:3) 




1995 


Eub338 


GCT GCC TCC CGT AGG 


F 


Fuchs et al, 




AGT (SEQ ID NO;4) 




1998 


Eubl055f 


ATG GCT GTC GTC AGC 


P 


Amann et al, 




T (SEQ ID NO:5) 




1995 


Eti.bl392r 


CGCCCGCCGCGCCCCGC 


P 


Amann et al., 




GCCCGGCCCGCCGCCCC 




1995 




CGCCCCACGGGCGGTGT 








GTAC(SEQ3DNO:6) 






Eubl492r 


AAG TCG TAA CAA GGA 


p 


Amann et al., 




AGC CGT A (SEQ ID 




1995 


Controls 


NO:8) 






Bnl253r 


CAT CGC TGC TTC GCA 


F,P 


Haygood and 




ACCC(SEQIDNO:9) 




Davidson, 








1997 


Bn240r 


TGC TAT IT G ATG AGC 


P 


Haygood and 
Davidson, 




CCG CGT T (SEQ ID 






NO:I0) 




1997 


Bncontrol 


ACG TCA CCG TCC AGC 


F 


Haygood. and 




CTCT(SEQIDNO:ll) 




Davidson, 








1997 


End 855 








Epl247r 


AAG TAG CGG CCC TTT 


P 


Schmidt et 




GTC (SEQ ID NO: 12) 




al. s 2000 



' P - PGR primer, F - FISH probe 
h Oligonucleotide sequences used for FISH experiments were 5' biothrylated 
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Denaturing gradient gel electrophoresis (DGGE) of bacterial 16S 
rRNA from E. turbinate. DGGE is a technique that has been used to separate 
and identify bacterial sequences from complicated mixtures of microbial 
populations based on the small sub-unit rRNA genes (SSU rRNA) (Sakai et ai, 
5 1 996). Ribosomal RNA is an Meal molecule to use for phylogenetk studies 
because it is relatively conserved throughout all organisms and contains 
interspersed variable regions. "Universal" primers are used to amplify the 
variable regions of the SSU rRNA gene from a mixed population of bacteria. 
These amplified gene fragments of the same si2e are then separated 
1 0 electrophoretically according to sequence differences (primarily GC content) on. 
a denaturing gel. The resulting bauds can be excised from the gel ; reamplified 
and sequenced. 

DGGE experiments were performed on SSU rRNA gene fragments 
(1055-1392 corresponding to E. coti nwnbering) amplified from B. inrbinata 

1 5 genomic DNA to study the diversity of associated microbes, PCR products 
containing a GC clamp were cleaned with a QIAQuick kit. The 8% acrylamide 
gel gradients were made with 20-70%, 30-70% and 30-60% denaturant (urea) 
and run in 1 X TAB buffer at 60°C for either 5 or 14 hours (200V or 80V, 
respectively) using a BioRad DCode DGGE system. Bands were stained for 20 

20 minutes with Sybre-gold™ fluorescent stain, documented using a Nucleotech gel 
documentation system with Nucleovision software and cut out using sterilized 
razor blades. Each hand was incubated in an equivalent volume of 0.5 X TE 
buffer overnight at 37°C and 5 pL of this solution was used as a template for 
PCR amplification with 1 055 f and 1392r primers. PCR products were cleaned, 

25 ethanol precipitated arid submitted for sequencing. 

Microscopy, For TEM, larvae were fixed in Karnovsky's fixative at 4°C. 
Samples were further fixed in 1% osmium tetroxide in cacodylate buffer for 1 
hour, stained with 4% ethanolie uranyl acetate, dehydrated in an 
ethanol/propylene oxide series, and embedded in Epon resin. Thick sections (1 

30 pm) were obtained on a Reichert Ultramicrotome and stained with toluidene 
blue. Thin sections (silver-gold by interference) were stained with bismuth 
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(QMS ill saturated bismuth per pi water) and 4% ethanolic uranyl acetate and 
examined on a Zeiss TEM. light and confocal microscopic images were 
obtained in the region near the adhesive papillae for orientation, 

IiniaaEoiocaiizationexperiineats, For immunogold experiments, 
5 larvae were feed, k paraformaldehyde aid then hybridized with the fluorescent 
BUB 338 probe (see FISH section), dehydrated in aa ethanol series, embedded in 
LR white resin, cured overnight at 50°C, and sectioned to gold thickness by 
interference. Sections were picked up on parlodion coated nickel grids and 
incubated on drops of blocking buffer (TBS, pH 7.6, 0.8% BSA, 5% normal goat 

1 0 scram, 0. 1 % sodium azide) for 30 minutes at room temperature. Grids were' 
transferred to goat anti-fluorescent antibody conjugated to 10 ran colloidal gold 
diluted 1 :25 with gold buffer (TBS, 1% normal goat serum, 0. 1% gelatin, 0,2% 
sodium azide) for 2 hours at room temperature. Grids were washed in buffer 
(blocking buffer without goat serum) 2 * 15 minutes, TBS 2 * 10 minutes, and 

15 distilled water 1 x 5 minutes. Tissues containing immunohnked gold were fixed 
by incubating sections in 2% glutaraldehyde in 0.1 M Na Cacodylaie buffer, pH 
7.4 for 10 minutes. Grids were then cotmterstained with. 2% aqueous uranyl 
acetate for 10 minutes followed by bismuth subnitrate ibr 10 minutes, and 
examined by TEM. 

20 Fluorescence in situ hybridization (FISH). The probes used for the 

hybridization experiments were the universal eubaeterial probe ETJB338 or 
"symbiont specific" probe Etl253r. The symbiont specific probe was designed 
based on the sequence obtained from die two DGGE bands (see above). The 
most variable regions of the 16S gene between 1055 and 1392 were analyzed for 

25 site accessibility and similar sequences using CHECK PROBE (EBP) (Amann ei 
al, 1995; Fuohs et ai., 1998). 

For control experiments, the following probes were used: the bryozoan 
symbiont specific probe {Endobugula sertula) Bnl253r (Haygood and Davidson, 
1997); a scrambled probe Bncontrol (Haygood and Davidson, 1997); the sponge 

30 symbiont specific probe Entoiheonelh palauensis 1249r (Schmidt et ah, 2000);. 
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and no probe controls. The sequences of all probes used are shown in Table 1 . 
Each probe was 5 ' labeled with either fluorescein or CY5. 

The hybrid : ■ i . r < >m Haygood and Davidson 

(Haygood and Davidson, 1997). Samples were fixed in 4% 
5 i i «'< del > PBS (pi > pi tebu ed sal » pH 7.4) hated thro j 
an ethanol series to 70% and stored at 4 C C. Approximately 10 larvae were 
permeabilized with proteinase K, refrxed with 4% paraformaldehyde in PBS, and 
blocked with acetic anhydride. After washing, they were then suspended in pre- 
hybridization buffer (1 mg total RNA and 100 pg heparin per mL. 10% 

10 formamide, 5X SSC, 0.1% Tween 20 s 5 mM EDTA and IX Denhardte' solution) 
and incubated at 43°C for 1 hour. For the hybridization step, labeled probe was 
added to a concentration of 2 ng/pl, and the samples were incubated at 55°C for 
10 minutes, and then overnight {-15 hours) at 43°C. Samples were then washed 
according to the following schedule: (3 ) 2 times for 15 minutes at 43°C with 

15 high stringency solution (10% formamide, 5 * SSC, and 1% sodium dodecyl 
sulfate); (2) 2 times 15 minutes at 37 S C with wash 2 (10% formamide, 2 x SSC, 
and 1% sodium dodecyl sulfate); (3) and once at room temperature for 30 
minutes with wash 3 (10% formamide, 2 x SSC, and 0.1% Tween 20). After the 
final wash, the larvae were suspended and stored hi PBS. For analysis, the 

20 larvae were mounted in Vectashield™ and examined wife a Zeiss 510 eonfocat 
laser-scanning microscope. All excitation wavelengths (488, 568, and 647 nrn) 
were used simultaneously to help visualize the tissue surroimding any signal. 
Images were obtained and manipulated using 3-D for LSM software. 
Results 

25 Chemistry. The larval extracts were very active in the HCT bioassay 

(EC* = 0.65 pg/mL), which suggested the presence of fee eeteinaseidins. No 
ecteinascidins could be detected by J H NMR analysis of the larval extracts. 
However, analysis by ESI MS provided a parent ion of 744 which corresponds to 
Bi-743 [M4-H3\ and bombardment of that peak produced the fragment 494, 

30 which corresponds to fee loss of the C-ring subunit (Ft gure 6). 
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PGR. The use of universal abacterial primers allowed amplification of 
-1400 bp of bacterial 16s r'RNA sequences from genomic DNA isolated from 
larvae, although the yields were generally low. The amplified products were 
submitted for sequencing directly, but a clean sequence was not obtained. The 
5 -3 00 bp PGR products were subjected to DGGE analysis (Figure 7). The two 
dif&se bands were cut out and sequenced. The consensus rRNA sequence was 
detennined to be 5 from 5* to 3*, 

TTGGGTTAAGTCCCGTAACGAGCGCAACCCTlTCCCITAGrrGCCAGC 
GTGTAAA.GACGGGGACTCTGAGGGGACTGCCGGTGATA.^ACCGGAG 

10 G^GGCGAGGACGACGTCAAGTCATCATGGTCCrrACGAGTAGGGCT 
AGACACGTGCTACAATGGTATGTACAAAGGGAGGCAAAATTGTAAAA 
TCTAGCAAATCCCGAAAAGCATATCTTAGTCCGGATrGA.AGTCTGCA 
ACTCGACTTCATGAj\GTTGGAATCGCTAGTAATCGCGAATCAGCATG 
'rCGCGGTGAATACGTTCCCGGGCCTT (SEQ ID NO: 13). 

1 5 Sequence analysis showed that the two bands were almost identical, and 
different hy only four bases/gaps. 

Phylogeaetic analysis. The -345 bp sequences obtained .from the 
DGGE bands were submitted to bom BLAST (GenBank) and RDP online 
analysis programs to find the most closely related sequences. The top S matches 

20 were aligned with the larval sequences and several outgroups using 

Sequeneher™. The related sequences used in the alignment and trees were 
uncultured gamma proteobacterium clone 8-65, Baikymodiotus platifrons, 
Meihyhhacter psychrophilus, Metkylmonas sp. LW 16, clone L0 1.3.6, 
endosyrabiont of the greenhouse whitefly, Trialewodes vaporariorum, 

25 Bathymodiohts japonicus, Halomoms nitritophihts. Outgroups were Vibrio 
fischeri md Streptomyces lavendulae {Figure 8). 

FISH, in the FISH experiments, both the EUB338 and E. sertula 
Bnl2$3r probes hybridized to specific cells. Little to no signal was detected 
from the no probe control, £ palauemis 1247r and the scrambled Bncontrol 

30 probe (Figure 9). The cells marked by signal were quite large (5-1 0 urn) and 
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appeared to be ascidian cells, Higher resolution analysis was attempted using 
thick sections but the ceils could not be identified (Figure 10). 

Due to the positive result from the E. sertula probe, the specificity of the 
binding was tested using more stringent PCS. amplification. PCR products were 
5 obtained for all bryozoan samples as expected, and only from the eubacterial 
primers for the E, tin- >in jta samples (Figure 11). 

Microscopy, Both light microscopic and TEM analysis of the larvae 
suggests that there is not a large population of bacteria either within or associated 
with the host cells, as the FISH experiments appeared to suggest (Figure 12). A 
1 0 few bacteria (1-3) could occasionally be seen by TEM within the larvae, but 
external to larva! ceils. The gold-labeled anti-fluorescein antibody experiments 
failed to localize the cells associated with the original oiigo signals. 

Discussion 

.1 5 The structural similarity of the ecteinasoidins to known microbial 

metabolites provided compelling circumstantial evidence for a bacterial source 
of these compounds in the ascidian. By focusing on non-feeding E. turbinate. 
larvae, complications such as filter feeding in the adults could be avoided. The 
secondary metabolites of the larvae were determined in conj unction with 

20 genomic studies in an attempt to obtain information about the biosynthetic 
source of the ecteinascidin compounds. 

The chemical analyses showed that Et-743 is present in the larvae in high 
enough yields that it can be easily detected in crude extracts by ESI mass 
spectrometry. Although the exact location of the metabolites is not known, it is 

25 suspected that at least some arc ass >cial A with the surface of the larvae due to 
the immediate deterrent effect against fish upon uptake into the mouth 
(Bingham, 1986). 

initial PGR experiments using bacterial primers suggested that bacteria 
were associated with the larvae. FISH experiments using eubacterial probes also 

30 suggested the presence of bacterial seq ei inside or associated with 

ascidian host cells. All the control probes except for the Bnl253r (Endobugula 
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serttda) probe provided negative results. The presence of E. serttda or closely 
related species was tested by PGR amplification using E. serttda specific 
primers. No amplification was detected. 

Dae to the relatively abundant signal inferred by the ©abacterial probe, it 
5 was expected that large numbers of bacteria would be detected by TEM analysis. 
However, extremely few morphologically distinct bacteria could be visualized 

It is possible that there are bacterial symbionts associated with the 
ascidian larvae in small numbers or with only adults that produce Et~?43. There 
are some invertebrates that contain obligate symbionts as adults but do not 

10 transmit them to their offspring. For example, sepiolid squids contain symbiotic 
bacteria {Vibrio fist hen'} in their light organs that are responsible for 
bioluminescent light production, but the symbionts are not vertically transmitted 
to their eggs. Instead, the young squids selectively obtain the same symbiont 
from the ambient seawaier (Nyholm et al, 2000), Another mode of vertical 

1 5 symbiont transfer is for the adult to brood eggs aposymbiotically and then allow 
the passage of bacteria onto them as they pass specific organs into the 
environment (Giere et al, 1991). Some larvae were excised from E. mrbinata 
for this study prior to natural release and could therefore be aposymbiotic due to 
this possible mode of symbiont transfer. However, TEM and PISH experiments 

20 conducted with both "natural" larvae and dissected samples provided identical 
results in all cases. 

The ecological and biological functions of Et-743 are still largely 
unknown, although there is some evidence to suggest that they serve as a feeding 
deterrent for the larvae (Young and Bingham, 1987). Feeding experiments using 

25 adult extracts showed that they were only deterrent when the protein level of the 
test foods matched the extremely low concentration in the ascidian (Kicklighter, 
2000), Other possible functions include prevention of bacterial or viral 
infection, settlement cues, and/or prevention of encroachment of nearby species. 
Some workers have also found that the tiger fistworm Pseudoceros cromri feeds 

30 exclusively upon adult B, mrbinata and selectively sequesters some of the 
ectemascidins (Wright, 1999). 
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Example 7 

Characterization of Putative Bacteria! Symbionts of Ect&nascidia turbinate 
DNA was isolated from Ecieii > inata \FT) Ian at sto ed in 

5 proteinase k (from fee Qiagen DNeasy kit) at 4°C. From fresh ETs, numerous 
larvae were collected .tor fixing for FISH, RNA isolation, DNA isolation, and 
DAPI staining. C-sections were performed and numerous larvae in different 
developmental stages were collected and stored, as well as the adults, in ethanol. 

10 Resul ts 

9.5 (ig RNA was isolated from various batches of larvae. 
DAPI staining showed fluorescence in small round dots of consistent 
size. A second DAPI staining was performed using ethanol-preserved larvae in 
order to control for potential auto-fluorescence by using an unstained control. 

1 5 WMle no aiitofiuorescence was seen in the second DAPI, the dots were not 
present in the stained samples. 

PCR experiments were performed using 130-190 ng of larval DNA and 
the L055f/I392r primers without the GC clamp; and, because the 338 probe 
sequence gave a positive signal in FISH (Example 6), in a separate reaction the 

20 3 38f/l 392r (non-GC ciamp) primers were used, PCR conditions were identical 
to those in Example 6. The PCR products were reamplified using the 
1 055171 392r primers, with a GC clamp on the reverse primer, and several 
reactions produced strong bands. 

The products were cleaned with an ethanol precipitation, quantified, and 

25 a DGGE with a 30-70% gradient on a S% gel was performed with both a 
negative and positive (E, coli) control. As in Example 6, samples gave rise to 
two wide bands on the DGGE gel, in addition to other bands. Seventeen bands 
were removed from the DGGE gel, DNA from the bands was purified and 
reamplified with the 10S5£1392r primers. Following clean-up, either 50 ng or 

30 90 ng of DNA DNA was subjected to sequencing. Sequence of the two bands 
was, 5' to 3', 
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Band lie 

TATGGCTGTCGTTCAGCTCGTGTCGTGAGATGTTGGGTTAAGTCCCGT 
AACGAGCGCAACCCTrrCCCTrAGTTGCCAGCGTGTAAAGACGGGGA 
CTCTGAGGGGACTGCCGGTGATAAACCGGAGGAAGGCGAGGACGAC 
5 GTCAAGTCATCATGGTCCTTACGAGTAGGGCTACACACGTGCTACAA 
TGGTATGTACCAAGGGAGGCAAMATTGTAAAATCTARCAAATCCCC 
MAAAGCATATCTTAGTCCGGATTGAAGTCTGCAACTCGACTTCATGA 
AGTTGGAATCGCTAGTAATCGCGAATCAGCTATGTCGCGGTGAATAC 
GTTCCCGGGCCTTGTACACACCG (SEQ ID NO: 1 4); and 

10 

Band lib 

TTAGTTGCAGCGTGTAAGACGGGGACTCTGAGGGGACTGCCGGTGAT 
AAACCGGAGGAAGGCGAGGACGACGTCAAGTCATCATGGTCC1TAC 
GAGTAGGGCTACACACGTGCTACAATGGTATGTACAAAGGGAGGCA 
15 AiiATTGTAAjf^ATCTAGCAAATCCCCAAAAGCATATCTTAGTCCGGAT 
TGAAGTCTGCAACTCGACTTCATGAAGTTGGA\TCGCTAGTA\TCGC 
GAATCAGCATGTCGCGGTGAATACGTTCCCGGGCCTTGTACACAC 
(SEQ ID NO: 15). 

20 Die consensus sequence from these bands was exactly as that of 

Example 6. Using this data, several phylogenetic trees using both distance and 
parsimony methods were constructed, and in all cases, the DGGE sequences 
were most closely related to two syraMonts of the winterly as described m 
Example 6, The sequences were also related to several Haiomonas species, 

25 

All publications, including patent documents and scientific articles, 
referred to in ibis application, including any bibliography, are incorporated by 
reference in tbeir entirety for all purposes to the same extent as if each individual 
publication were individually incorporated by reference. 

30 
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WHAT IS CLAIMED IS: 

1 . A composition, comprising a bacteria isolated from Ecteinascidia 
twbinata. 

5 

2. The composition of claim 1, wherein said bacteria comprises at least one 
activity that is involved in the production of an ectem&scidin or a precursor 
thereof. 

10 3 . A composition, comprising: at least one nucleic acid molecule that 

encodes at least one polypeptide that catalyzes at least one step in the synthesis 
of at least one ectemascidin, wherein said at least one nucleic acid is derived 
from at least one marine organism. 

15 4, The composition of claim 3, wherein said at least one polypeptide 
comprises at least one activity of at least one non-rihosomal peptide synthase, 

5. The composition of claim 3, wherein said at least one marine organism 
comprises at least one bacteria. 

20 

6. The composition of claim 3, wherein said at least one marine organism 
comprises at least one invertebrate. 

7. The composition of claim 6, wherein said at least one invertebrate 
25 comprises at least one Ecteinascidia. 

8. The composition of claim 7, wherein said at least one Ecteinascidia is 
Ecteinascidia turbinate. 

30 9. The composition of claim 3, wherein said at least one nucleic acid 
molecule further comprises at least one expression control sequence. 
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1 0. The composition of claim 3 } wherein said nucleic acid molecule is in a 
vector, 

1 1 . The composition of claim 10, wherein said vector Is within a cell. 

5 

12. The composition of claim 3, wherein said at least one nucleic acid, 
molecule is within a ceil. 

13. A composition, comprising a library of nucleic acid molecules of claim 3. 

10 

14. A composition, comprising: at least one polypeptide that catalyzes at 
least one step in the synthesis of at least one eetemascidra, wherein said at least 
one polypeptide is derived from at least one marine organism. 

15 15. The composition of claim 14, wherein said at least one ecteinascidia 
comprises Et-743. 

1 6. The composition of claim 14, wherein said at least one marine organism 
comprises at least one bacteria. 

20 

17. The composition of claim 14, wherein said at least one marine organism 
comprises at least one invertebrate. 

1 8. The composition of claim 1 7, wherein said at least one invertebrate 
25 comprises at least one Ecteinascidia, 

19. The composition of claim 1 8 , wherein said at least one Ecteinascidia is 
Ecteinascidia turbinate. 

30 20. The composition of claim 1 4 5 wherein said at least one polypeptide is 
within a cell. 
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21. A composition, comprising a library of polypeptides of claim 14, 

22. A composition comprising an antibody or an active fragment thereof that 
binds with or specifically binds with a bacteria of claim L 

5 

23 . A composition comprising an antibody or an active fragment thereof that 
binds with or specifically binds with a polypeptide of claim 14. 

24. A method of making a composition, comprising: providing at least one 
1 D composition of claim i , and synthesizing at least one ecteinascidin. 

25 . A composition made by the method of claim 24. 

26. The composition of claim 25, wherein said composition does not 
1 5 comprise a known ecteinascidin. 

27. The composition of claim 24, comprising at least one pharmaceutically 
acceptable earner. 

20 28 . The composition of claim 24, wherein said composition is a 
pharmaceutical composition. 

29. A method of making a composition, comprising: providing at least one 
composition of claim 3, and synthesizing at least one ecteinascidin. 

25 

30. A composition made by the method of claim 29. 

3 1 . The composition of claim 30, wherein said composition does not 
comprise a known bryostatjm 

30- 
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32, The composition of claim 30, comprising at least one pharmaceutic ally 
acceptable carrier. 

33, The composition of claim 30, wherein said composition is a 
5 phmmaceutical composition. 

34, A method of making a composition, comprising: providing at least one 
composition of claim 14, and synthesizing at least one ecteinascidin. 

10 35. A composition made fay the method of claim. 34. 

36. The composition of claim 35, wherein said composition does not 
comprise a known ecteinascidin. 

1 5 37, The composition of claim 35., comprising at least one pharmaceuticaliy 
acceptable carrier. 

38. The composition of claim 35, wherein said composition is a 
pharmaceutical composition. 

20 

39. A method for identifying at least one nucleic acid molecule encoding at 
least one polypeptide that catalyzes at least one step in the synthesis of an 
ecteinascidin , comprising: contacting a nucleic acid molecule of claim 1 with, a 
sample, and identifying nucleic acid molecules in said sample that hybridize with 

25 said nucleic acid molecule of claim 1 , 

40. Hie method of claim 39, wherein said sample is derived at least in part 
from an environmental sample. 

30 41 . The method of claim 40, wherein said environmental sample is derived at 
least in part from a marine environment. 
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42. A nucleic acid molecule identified by the method of claim 39, 

43. The nucleic acid molecule of claim 42, comprising an expression control 
sequence. 

5 

44. The nucleic acid molecule of claim 42 in a vector. 

45 . The nucleic, aeid molecule of claim 42 in a cell. 

10 46. A composition comprising a library of nucleic add molecules of 
claim 42. 

47. A method for identifying a bioactive compound, comprising: contacting a 
composition of claim 1 with an in vitro, ex vivo or in vivo detection system and 

1 5 determining the hioaetivity of said compound. 

48. A bioactiv e compound identified by the method of el aim 47, 

49. The bioactive compound of claim 48 in a pharmaceutical^ acceptable 
20 carrier. 

50. The bioactive compound of claim 48, wherein said bioactive compound 
is a pharmaceutical compound. 

25 51, A method for identifying a bioactive compound, comprising: contacting a 
composition of claim 3 with an in vitro, ex vivo or in vivo detection system and 
determining the hioaetivity of said compound. 

52 . A bioactive compound identified by die method of claim 5 i . 

30 
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53. The bioactive compound of claim 52 m a piiarmacentically acceptable 
carrier. 

54. The bioactive compound of claim 52, wherein said bioactive compound 
5 is a pharmaceutical compound. 

55. A method for identifying a bioactive compound, comprising: contacting a 
composition of claim 14 with an in vitro, ex vivo or in vivo detection system and 
determining the bioactivity of said compound, 

10 

56. A bioactive compound identified by the method of claim 55, 

57. The bioactive compound of claim 56 in a pharaiaceiitically acceptable 
carrier. 

15 

5 S. The bioactive compound of claim 56, wherein said bioactive compound 
is a pharmaceutical compound. 
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international preliminary examination (Rule 66.1(e) PCT). The applicant 
1s advised that the Z?Q policy when acting as an International 
Preliminary Examining Authority is normally not to carry out a 
preliminary examination on matter which has not been searched. This is 
the case Irrespective of whether or not the claims are amended following 
receipt of the search report or during any Chapter II procedure. 
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